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Use and abuse of ancient DNA 


Researchers in several complementary disciplines need to tread carefully over the shared 


landscapes of the past. 


one damned thing after another, but historians and archae- 

ologists spend a lot of their time trying to put those things 
into the right order. Assistance from science over the decades has 
been transformative, but not without difficulty: it took years for some 
archaeologists to be won over by radiocarbon dating. 

Now, historians and archaeologists are grappling with a new 
scientific technique. As we discuss in a News Feature on page 573, the 
genetic study of ancient DNA is exploding, and the findings are posing 
several problems. One is a need for geneticists, archaeologists, histo- 
rians and anthropologists to understand exactly how their skills and 
insights complement each other's. It is clear, for example, that although 
genetics has useful things to say about the sweep of population history, 
the more conventional disciplines provide essential context. 

Another problem is fear that simplistic takes on ancient DNA will 
mirror damaging uses of the idea of ‘culture history’ Culture history 
views the discovery of old artefacts as a proxy for the movement of 
the people who made them. According to this idea, a particular floral 
design on a pot that spread from south to north over a few centuries, 
for example, would indicate that the specific group of people that 
painted it was on the move — and carried the design with it. 

These fears are not just about scholarship. Simplistic readings of 
culture history have encouraged people with political agendas to 
falsely draw clear boundaries between the behaviour and the claimed 
territory of some ancient (and not-so-ancient) populations — and to 
infer similarities with their claimed modern equivalents. For exam- 
ple, they often refer to the work of early-twentieth-century German 
archaeologist Gustaf Kossinna, who used culture history to trace the 
supposed origins of modern Germany to the spread of Corded Ware, 
a type of ceramic found throughout central Europe in the Bronze Age. 
Kossinna’s ideas, although influential, have proved to be scientifically 
simplistic. They became notorious following their use by the Nazi 
party to legitimize its territorial goals and beliefs about the racial supe- 
riority of German-speaking peoples. 

Scholars are anxious because extremists are scrutinizing the results 
of ancient-DNA studies and trying to use them for similar misleading 
ends. Ancient DNA, for example, offers evidence of large migrations 
that coincide with cultural changes in the archaeological record, 
including the emergence of Corded Ware. Some archaeologists 
have expressed fears that the extremists will wrongly present such 
conclusions as backing for Kossinna’s theories. 

Another problem for archaeologists and historians relates to the poten- 
tial for abuse of the results of ancient-DNA studies looking at more recent 
times, such as the Migration Period around the fall of the Roman Empire 
or the era covered by the Viking sagas. They worry that DNA studies of 
groups described as Franks or Anglo-Saxons or Vikings will reify them 
by attaching misleading genetic profiles to categories that were devised 
by historians, and are not representative of how individuals viewed 


H istory might, as historian Arnold Toynbee allegedly said, be 


themselves at the time. Already, some people have picked up on such 
studies as a way to try to trace their roots to such supposed populations, to 
justify claims they havea right to some territory or other (L.-J. Richardson 
and T. Booth Papers Inst. Archaeol. 27,25; 2017). 

On the contrary, genetic and historical evidence suggests that there 
was widespread mixing during these periods, across populations and 
geography. Indeed, presented correctly alongside insights from other 
disciplines, ancient-DNA research can bea 


“Presented powerful weapon against bigotry. Studies 
correctly ’ documenting migrations can drive home the 
ancient-DNA point that present-day peoples in one area 
research can often share few genetic links with ancient 
be a powerful peoples who lived in the same place. And 
weaponagainst — when they do focus on relatively recent times, 
bigotry.” DNA projects can highlight the diversity of 


past peoples who otherwise might be seen as 
homogenous. A 2016 study of Anglo-Saxon burials, for example, found 
a mix of ancestry, with some people related to earlier inhabitants of 
England and others tracing their ancestry across the Channel (S. Schif- 
fels et al. Nature Commun. 7, 10408; 2016). 

Two recommendations can be made for the public behaviour of 
scientists and other scholars. The first: give ample credit to the insights 
of complementary disciplines. The second: refute statements that mis- 
construe what your insights actually reveal and that can be used politi- 
cally to justify disrespect, or worse, to groups of people. = 


Digital trust 


A scandal over an academic’s use of Facebook 
data highlights the need for research scrutiny. 


personal-data scandal, which has captured global public atten- 
tion for more than a week. But when the dust settles, researchers 
harvesting data online will face greater scrutiny. And so they should. 
At the centre of the controversy is Aleksandr Kogan, a psychologist 
and neuroscientist at the University of Cambridge, UK. In 2014, he 
recruited people to complete a number of surveys and sign up to an app 
that handed over Facebook information on themselves — and tens of 
millions of Facebook friends. Kogan passed the data to SCL, a UK firm 
that later founded controversial political-consultancy firm Cambridge 
Analytica in London. (All those involved deny any wrongdoing.) 
Last week, Facebook announced restrictions on data harvesting by 
third parties, including drastically reducing the kinds of information 
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that app developers can access. (It had already changed its rules in 2014 
to stop developers gleaning data from users’ friends through their apps.) 
But damage has been done: the public has good reason to be angry 
about the way in which researchers and companies have seemingly 
used personal data without consumers’ full understanding or consent. 

Where do academic researchers fit in? Handled correctly, online data 
can be a major boon to research, and the world would benefit from 
companies such as Facebook making their data more open. Ethical safe- 
guards for research that intervenes in human lives were largely set up for 
medical and psychological studies, and are often written with definitions 
that exclude Internet research. In the United States, for example, unless 
data collected are both private and identifiable, informed consent is 
usually not deemed necessary, and research requires minimal, if any, 
oversight by an institutional review board. This would include data from 
Twitter, which are by default public. Models built on anonymized Face- 
book data would also tend to be exempt. 

Kogan’s study was unusual, both in that it was done by a university 
academic for a private company he operated, and in that the data 
were passed to a third party. Yet there is a common theme behind this 
controversy and ones that preceded it — such as a study warning that 
someone’s sexual orientation could be determined from their online 
presence (Y. Wang and M. Kosinski J. Personality Soc. Psychol. 114, 246- 
257; 2018). Data were used in ways well beyond what users expected 
or intended. Bundled together and trawled by algorithms, innocuous 
data points can reveal information that users might reasonably expect 
to stay private and that might be used in ways they are not happy with. 


Guidance does exist. A number of projects are grappling with the 
ethical challenges of big data. US and European funders have supported 
efforts in this area, and have issued recommendations such as rethinking 
what counts as ‘public’ data and the need to consider a study’s potential 
harm to society, as well as to individuals. (The University of Cambridge 
is among the institutions writing guidelines for Internet-mediated 
research, after the UK Research Integrity Office issued non-binding rec- 

ommendations on the topic in 2016.) Funders 


“Innocuous should further support such efforts, and make 
data points can them better known to researchers. 
reveal personal Sticking points remain, a major one being 


that consent is often not practical when 
retrospectively accessing data from millions 
of individuals. But as outlined for biomedical scientists in the 1978 
Belmont Report, the principle of beneficence applies: researchers should 
put the good of research participants first and, with that in mind, per- 
form their own assessment of risks versus benefits. Studies should not 
be done just because the data are there. In studies that are too large to ask 
participants for consent, researchers should poll the views of samples of 
subjects and of any population that could be affected by the outcomes. 
Ethics training on research should be extended to computer scientists 
who have not conventionally worked with human study participants. 
Academics across many fields know well how technology can out- 
pace its regulation. All researchers have a duty to consider the ethics 
of their work beyond the strict limits of law or today’s regulations. If 
they don't, they will face serious and continued loss of public trust. = 


information.” 


Burger al funghi 


Mushroom -beef blends can tackle expanding 
waistlines and carbon footprints. 


the United States. But last month, the US fast-food chain Sonic 

Drive-In tweaked its menu to introduce something different: 
a burger with some of its beef replaced with mushrooms. The Signa- 
ture Slinger cheeseburger, Sonic promises, will deliver drive-through 
customers “all of the flavor with none of the guilt”. 

The claimed conscience-easing credentials of the Slinger are twofold: 
reduced calories and a smaller carbon footprint. It's targeted at those 
who wouldnt switch to a non-meat alternative, such as soya. And the 
burger does not come alone. It’s merely the latest offering from a move- 
ment that has united chefs, scientists and health and environmental 
advocates. Led by the Culinary Institute of America in Hyde Park, New 
York, with predictable support from mushroom producers, the initiative 
seeks to promote both public health and environmental sustainability, 
by replacing 100% ground beefin some foods with a blend that contains 
up to half mushrooms. It’s the biggest new foodie trend that you haven't 
heard of. Harvard University in Cambridge, Massachusetts, and the Uni- 
versity of Southern California in Los Angeles are among the institutions 
that have started to cook with it routinely. 

Does the science support the claims? 

Beef has an outsize environmental footprint. Per gram of protein, 
beef requires on average 50 times more land and produces 100 times 
more greenhouse gases than do beans and other plant-based proteins. 
Replacing 30% of the beef with mushrooms in the roughly 10 billion 
burgers that Americans eat each year would reduce emissions equivalent 
to taking 2 million cars off the road, according to the World Resources 
Institute, an environmental think tank in Washington DC. 

Still, mushroom cultivation is not a green panacea. Mushrooms are 
fussy about temperature and are usually grown inside energy-hungry 
climate-controlled sheds. A 2012 analysis reported that mushrooms 


B ig burgers and petrol-guzzling vehicles: it’s a match made in 
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have a greenhouse-gas impact almost ten times that of vegetables such 
as onions, carrots and cabbages (M. Berners-Lee et al. Energy Policy 
43, 184-190; 2012). 

Although individual consumers who choose a mushroom substi- 
tute might feel less guilty, demand for beef continues to rise along- 
side incomes around the world. As such, scientists and governments 
must continue to look for ways to reduce the impact of beef farming 
itself. Expanding cattle pasture is the main driver of deforestation in 
the Brazilian Amazon, and one-third of the world’s cropland goes to 
feeding animals — usually cattle. Research is needed to raise pasture 
productivity, improve feed and reduce methane emissions. So, too, is 
a renewed effort to combat deforestation. 

On the health front, research has shown that substituting one-quarter 
of the beefin meals with mushrooms decreases caloric intake by about 
one-third. Saturated fats also drop, and chefs can often make do with less 
salt. Tests across US school dinners suggest that these benefits seem to 
come without a negative impact, such as leaving pupils hungry. 

Inan era of expanding waistlines, this is potentially good news. Earlier 
this month, health authorities in the United Kingdom called on food 
manufacturers to voluntarily reduce the calories in processed products 
by 20% by 2024. Doing so would save nearly £9 billion (US$12.7 billion) 
and prevent more than 35,000 premature deaths due to obesity-related 
illness over 25 years, Public Health England estimated. Sonic’s blended 
burger is substantially smaller than its regular cheeseburger and would 
more than deliver on that goal, starting at around 340 calories compared 
with 580 calories for the chain's standard. 

Will it and other mushroom-blended meats succeed where health 
and environmental advocates have failed, and persuade people to eat 
less beef? Certainly, it seems a better approach than simply asking 
people to change their ways by pointing out the cost. 

But, as with zero-alcohol wine and sugar-free soft drinks, taste will 
ultimately determine consumer choice. So how does the Signature 
Slinger score? In a blind taste comparison, two Nature reporters were 
both able to correctly identify the mushroom blend. Setting aside the 
fact that both patties had the distinct taste and texture of fast food, one 
person preferred the pure beefburger, and the other the mushroom 
blend. Both agreed that if the blended burger was the future, then the 
future was better with (guilt-inducing) bacon. = 
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sentenced to seven years in prison for securities fraud. But those 

charges are not what gained him notoriety. In August 2015, 
his company bought monopoly rights to a life-saving drug that treats 
parasitic infections in people with weak immune systems. Without 
adding any value or doing significant research, the company then 
raised the drug's price by more than 5,000%, from US$13.50 to $750 
for a single tablet. 

Plenty of drug-industry insiders decried Shkreli as “not one of us’, 
emphasizing the virtues of biotechnology and innovative pharmaceu- 
ticals. lam a fan of the drug industry — I invest in promising leads and 
start-ups. But the public sees our industry as one entity, not an industry 
of good and bad players. When any executive engages in price gouging, 
itis a reflection on us all, and validates the negative 
bias against our industry. Innovators and oppor- 
tunists will go down together, unless the former 
agree to push the latter out. 

Yes, the drug industry is a business. But it is a 
business with a beautiful purpose: treating what 
ails people. As a venture capitalist, I look for ideas 
that increase value. If we succeed in finding new 
and better therapies, we should be rewarded for 
the risks and investments we make. A course of 
the hepatitis-C drug sofosbuvir (Sovaldi) can 
run, controversially, into the tens of thousands 
of dollars — but the drug took years of cutting- 
edge research to produce and cures a disease that 
otherwise requires a liver transplant, which costs 
upwards of $500,000. 

As George Merck, president of drug company 
Merck from 1925 to 1950, put it: “We try never to 
forget that medicine is for the people. It is not for the profits. The profits 
follow, and if we have remembered that, they have never failed to appear. 
The better we have remembered that, the larger they have been.” 

Passed down from Merck to executives such as Roy Vagelos, a former 
head of Merck, and George Yancopoulos, now chief scientific officer at 
Regeneron Pharmaceuticals, is a fierce belief that if you do the science 
right, you will ultimately deliver innovative drugs to patients that will 
save lives and cure disease. I feel confident that investments (some by my 
firm) in science-driven companies such as Genentech, Kite Pharma and 
Achaogen will produce results. So will a new generation of companies, 
such as eGenesis, Encoded Genomics and Vir Biotechnology, which 
work, respectively, on producing transplant organs, ways of regulating 
genes and fighting infection. It takes people with their kind of scientific 
fanaticism and high standards to hold our collective feet to the fire. 

I think there are many more George Mercks in the drug industry than 
Martin Shkrelis. But too many in the industry, driven by the demands 
of Wall Street, have shifted to a model that seeks to maximize profit 
over helping patients. Among others, Michael Pearson, former chief 


r | Nhis month, pharmaceutical executive Martin Shkreli was 
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Drug executives should 
take a Hippocratic oath 


The industry must earn patients’ trust that new medicines really are 
worth the price, says Bob More. 


executive at Valeant Pharmaceuticals, has been publicly criticized for 
jacking up prices on several drugs. And I have, unfortunately, been 
privvy to conversations in which executives planned price rises to force 
wholesalers to make larger purchases so they can hit sales expectations. 

None of us wants to believe that we are part of the problem. But as 
long as we tolerate the presence of these bad actors, we are all complicit. 

An academic colleague once asked me about violating a compa- 
ny’s patents because, he assumed, “you guys in business do that all 
the time”. No, we don't. We depend on patents. Responsibility in our 
business means following the laws that safeguard our own intellectual 
property when the time comes. 

Over the long term, mistrust raises the cost of capital and makes it 
difficult to fund worthy science. Just two weeks ago, the US govern- 
ment charged Theranos and its chief executive, 
Elizabeth Holmes, with defrauding investors. 
(She and the company reached a settlement 
without denying or admitting wrongdoing.) 

In my experience, the biggest drag on our 
stock prices is our bouts of dishonesty with Wall 
Street, physicians and patients. And, ultimately, 
it’s the patients who suffer. 

The innovations ahead of us are both breath- 
takingly effective and expensive: gene editing, 
chimaeric organs, personalized immunotherapy. 
But members of the public will not support this 
research if they suspect that they are being gamed. 
Transparency about pricing decisions, while 
sometimes painful in the short run, builds trust. 

So, what to do? When we see practices that put 
profits ahead of patient needs, we need to call foul 
publicly and not look the other away. We need to 
move away from gimmicks that raise only the price of drugs, not their 
value. People who use these tricks should be fired. Their projects should 
be shunned by venture capitalists and potential funding partners. We 
should lobby for a patent system that rewards real novelty, not more 
years of monopoly for a mint-flavoured version of an existing drug. 
(I exaggerate, but many of the reformulations used to retain rights are 
incremental improvements, at best.) Trade groups such as the Biotech- 
nology Industry Organization and PhRMA, both in Washington DC, 
should emphasize this in their codes of ethics, and members should 
hold each other to account. 

If the pharmaceutical industry is to thrive, we must all embrace the 
Hippocratic oath, whether in academia, business, finance or at the 
bench. The cost of lost trust to the drug industry is worth much more 
than manipulated short-term gains. m 


Bob More is a managing partner at venture-capital firm 
Alta Partners in San Francisco, California. 
e-mail: bmore@altapartners.com 
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Exoplanet probe 
The European Space Agency 
will launch the world’s first 
space telescope dedicated to 
studying the atmospheres of 
exoplanets, after it selected 
the mission for development 
on 20 March. The mission, 
called Atmospheric Remote- 
sensing Infrared Exoplanet 
Large-survey (ARIEL), will 
probe more than 1,000 known 
exoplanet systems. See page 
571 for more. 


Yellow-fever rise 


In response to an unusually 
severe outbreak of yellow fever, 
Brazil’s health minister Ricardo 
Barros announced on 20 March 
that the government intends 

to vaccinate the country’s 
entire population against the 
disease. Since July 2017, the 
virus has infected 920 people 
and killed 300. Officials 
recorded 196 deaths over the 
same period in the previous 
year. The decision to expand 
yellow-fever vaccinations was 
not made sooner because it is 
not without risk: one recipient 
in 100,000 develops dangerous 
reactions, such as hepatitis. 
The government hopes that, by 
April 2019, it will have reached 
the 78 million people who 
remain unvaccinated. 


Guinea worm 

South Sudan has ended the 
spread of Guinea-worm 
disease within its borders, 
leaving Chad and Ethiopia 
as the only two countries 
with ongoing spread of 

the debilitating parasitic 
infection. South Sudan has 
gone 15 consecutive months 
— longer than the worms life 
cycle — without recording a 
case, said the Carter Center 
in Atlanta, Georgia, on 

21 March; the organization 
is leading a global effort to 


ester 
Pete 


Weather costs for 2017 break records 


The costs of severe weather and climate events 
hita record high of US$320 billion in 2017, the 
World Meteorological Association (WMO) 
said in its annual state-of-the-climate report 
on 22 March. The bulk of the bill — some 

$265 billion — was attributable to damages 

in the United States from hurricanes Harvey, 
Irma and Maria. The WMO also cited the costs 


eradicate Guinea worm. The 
World Health Organization 
in Geneva, Switzerland, must 
still certify that the disease 
has been eradicated in South 
Sudan, which involves a much 
longer process. Kenya was 
deemed free of Guinea worm 
only in February, despite last 
recording a case in 1994. 


Frozen records 


Arctic sea ice reached its 
peak winter extent on 17 
March, marking the second- 
lowest maximum in the 
39-year satellite record. 

At 14.48 million square 
kilometres, this year’s peak 

is just 0.4% above last year’s 
record-low maximum extent. 
The four lowest maxima all 
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since 2005. 


happened in the past four years, 
the US National Snow and Ice 
Data Center said on 23 March. 
This winter saw parts of the 
Arctic freezing up relatively 
late, followed by warm air 
temperatures throughout much 
of the season. 


Brexit deal urged 


UK politicians have urged 
the government to make an 
early deal with the European 
Union on science. Members 
of Parliament on the House 
of Commons Science and 
Technology Committee 
warned that post-Brexit 
arrangements for research 
and innovation could become 
a bargaining chip in wider 
trade negotiations unless 


of monsoon flooding in Asia and drought in 
eastern Africa (pictured, the dry Mandrare 
River in Madagascar). Overall, 2017 was the 
third-hottest year on record, with global average 
temperatures at roughly 1.1°C above the pre- 
industrial average, according to the report. The 
nine warmest years on record have all occurred 


research talks are made an 
urgent priority. Leaving a 

deal on science until later in 

the year risks damaging the 
United Kingdoms status “as 

a science superpower’, they 
argued in a report published on 
19 March. Leading concerns 
include clarifying whether the 
government will seek to join 
the European Commission's 
next research-funding 
programme, and whether there 
will be special immigration 
arrangements for scientists. 


Whale stranding 
Roughly 150 short-finned 
pilot whales (Globicephala 
macrorhynchus) were found 
stranded on an Australian 
beach 315 kilometres south 
of Perth on the morning of 
23 March. Animal workers 
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and volunteers were able to 
rescue and release just seven 
whales back to deeper waters 
by nightfall; the rest died. 
Three whales that had been 
released returned to the beach 
the following day; they were 
humanely killed. All but two 
of the dead whales have been 
removed from the beach 

at Hamelin Bay, near the 
continent's southwestern tip. 
The area was also the site ofa 
stranding of 87 pilot whales 
and dolphins in 2009. 


Science campus 


The French government has 
endorsed an international 
panel's recommendation 

to delay for 30 months a 
decision on whether to 

give permanent ‘excellence 
status to the Paris-Saclay 
University cluster, part of the 
Saclay science super-campus 
southwest of Paris (pictured). 
The newly established cluster, 
a grouping of 14 institutions, 
has temporary excellence 
status under the government's 
IDEX initiative and receives 
€30 million (US$37 million) 
in funding a year; permanent 
status would guarantee that 
income. The cluster has been 
on probation since 2012, and 
must become an integrated. 
establishment by the 2020 
academic year, the panel said. 
The Saclay science project 
initially aimed to bring some 


TREND WATCH 


Facebook users routinely 
encounter science-related 
information in practical, 

‘news you can use posts or 

in advertisements. A study 

by the Pew Research Center 

in Washington DC analysed 
6,582 posts randomly selected 
from 30 science-related Facebook 
pages. Nearly 30% were directly 
about scientific discoveries, 
whereas 21% were ‘news you can 
use and 16% were promotions. 
The study found that visual posts, 
such as videos or images with little 
text, received most engagement. 


20 research and teaching 
institutions under one 
umbrella. But progress was 
stalled by reluctance from elite 
institutions known as grandes 
écoles to give up their names 
and autonomy. Last October, 
President Emmanuel Macron 
announced that the project 
would proceed with two 
clusters: one called Paris-Saclay 
University, the other called 
NewUni. 


| FUNDING 
US budget deal 


Nearly all US science agencies 
would see their budgets grow in 
2018, under a US$1.3-trillion 
deal that President Donald 
Trump signed into law on 

23 March. The legislation, 
which funds government 
operations up to 30 September, 
would boost funding for the 
National Institutes of Health 

to $37 billion, $3 billion above 
the 2017 level. The National 
Science Foundation would get 


SCIENCE ON SOCIAL MEDIA 


REN A 
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$7.8 billion, up $295 million, 
and NASAs budget would rise 
by $1.1 billion, to $20.7 billion. 
For the second year running, 
lawmakers in Congress ignored 
the steep cuts to science 
agencies sought by Trump. 

See page 572. 


Declining diversity 


Human activities are 
threatening the security of food 
and clean water by causing a 
rapid decline in the variety of 
plants and animals on Earth, 
according to United Nations- 
backed assessments released. 
on 23 March. Conducted by 
more than 500 experts across 
100 countries, the 3-year study 
looked at all areas of the world 
except the poles and the open 
ocean. Researchers found 

that in nearly every region, 
pollution, climate change and 
unsustainable exploitation 

of natural resources are 
endangering biodiversity. The 


An analysis of 30 science-related pages on Facebook found that almost 
one-third of the content covered new discoveries. 


All other types 
34% 


New discoveries 
29% 


6,582 


SCIENCE-RELATED 
FACEBOOK POSTS 


Advertisements or 
promotions 16% 


‘News you can use’ 
21% 
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assessments estimate that if 
the situation continues as it 

is, by 2050 there will be no 
exploitable fish stocks in the 
Asia-Pacific region, and the 
Americas will lose 40% of their 
biodiversity. 


een REG AUE Seeman 
Public-health chief 


HIV researcher Robert 
Redfield will lead the US 
Centers for Disease Control 
and Prevention (CDC), the 
secretary of health and human 
services, Alex Azar, announced 
on 21 March. Redfield “has 
dedicated his entire life to 
promoting public health and 
providing compassionate 

care to his patients’, Azar 

said in a statement. Redfield, 
co-director of the Institute 

of Human Virology at the 
University of Maryland in 
Baltimore, has studied HIV 
epidemiology, pathology and 
vaccines for over 30 years. He 
replaces Brenda Fitzgerald, 
who resigned on 31 January 
following news reports that she 
had traded stock in tobacco 
companies while leading 

the agency. See go.nature. 
com/2pbabvn for more. 


FACILITIES 


Sports brain bank 
Australia’s first bank for 
collecting the brains of 
sportspeople opened on 

27 March. The Australian 
Sports Brain Bank will 

enable research into chronic 
traumatic encephalopathy. 
The condition affects people 
with a history of repeated 
brain injury, such as those who 
have played contact sports, 
and military veterans. The 
bank, a joint initiative of the 
University of Sydney’s Brain 
and Mind Centre and the city’s 
Royal Prince Alfred Hospital, 
has so far received donation 
pledges from six Australian 
former sports professionals. 
It will also collect brains from 
people who haven't played 
sport professionally. 


> NATURE.COM 
For daily news updates see: 
www.nature.com/news 
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A volunteer in the Alzheimer’s study at the University of Antioquia, Colombia, who is unsure whether she carries a genetic mutation that causes the disease. 


Alzheimer’s study zeroes in 
on enigmatic protein 


Researchers in Colombia hope to uncover new drug targets. 


BY SARA REARDON, MEDELLIN, COLOMBIA 


hon Kennedy was building a house for 

his family when he realized that his 

45-year-old father was beginning to strug- 
gle with daily life. His father tried to help 
with the construction project but often forgot 
to complete simple tasks. And he kept getting 
lost on the way home from work. 

Jhon Kennedy wasn't surprised: his four 
uncles had also started to lose their memories, 
one by one. But their doctors in Colombia's 


rural Antioquia region, which is known for its 
mountainous terrain and coffee plantations, 
had never heard of early-onset dementia. It 
wasn‘ until a cousin learned about a study 
of Alzheimer’s disease at the University of 
Antioquia in Medellin that Jhon Kennedy’s 
relatives understood the illness they faced. For 
more than three decades, researchers there have 
been tracking a genetic mutation — common 
in the region — that causes Alzheimer’s to 
strike people in their 40s and 50s. 

Later this year, a team at the university will 


begin scanning the brains of some of the study 
participants using a technique that is available 
only ina few major medical centres worldwide. 
It will allow the researchers to track a protein 
called tau, which accumulates rapidly in the 
brains of people with Alzheimer’s as symp- 
toms begin to emerge. Watching tau form in 
real time could reveal the part it plays in the 
disease, says Francisco Lopera, the neurologist 
who is leading the research. 

Many scientists have long thought that the 
disease is triggered by another protein, 
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The University of Antioquia Brain Bank stores brains from people who had Alzheimer’s disease. 


> amyloid, that builds up in the brains of 
people with Alzheimer’s. But several drugs 
that reduce amyloid levels have failed to relieve 
the symptoms of the disease in clinical trials, 
increasing interest in the role of tau. 

If all goes well, Lopera’s team will soon be 
the first in Colombia with the ability to scan 
people’s brains for tau. In February, he and 
his colleagues published the results of a pilot 
study in which they brought 24 people from 
Colombia to Boston, Massachusetts, and 
used positron-emission tomography (PET) 
to search their brains for tau. The researchers 
showed for the first time that tau begins to 
accumulate in the brains of people with the 
Antioquia mutation six years before they begin 
to show signs of disease’. 

“This is a very definitive paper,’ says Bruce 
Miller, a behavioural neurologist at the 
University of California, San Francisco. “I 
think it’s another piece of evidence that tau is 
very important and has a strong correlation 
with clinical symptoms.” 

The genetic mutation that affected Jhon 
Kennedy’s father and uncles is well known 
in the field of Alzheimer’s research. It prob- 
ably arrived in South America with Spanish 
conquerors 375 years ago, and now affects 
25 extended families in Antioquia with some 
5,000 members. Researchers have published 
dozens of papers about this group, including 
some of the clearest evidence that amyloid 


plaques can accumulate in the brain decades 
before symptoms of Alzheimer’s appear’. 

But because questions have emerged about 
amyloid’s role in Alzheimer’s, scientists have 
taken a closer look at tau. The protein normally 
helps to stabilize the structures that enable 
neurons to communicate with one another. 
People with Alzheimer’s make too much dys- 
functional tau, causing these structures to col- 
lapse into tangles. The amount of tau seems 
to increase in step with a person’s symptoms. 

In the past several years, researchers have 
developed radioactive biomarkers that allow 
them to detect tau in living people using 
PET. Studies have shown that tau accumula- 
tion in the brain's language centres correlates 
with speech problems, for instance*. Several 
research groups are beginning clinical trials of 
drugs that scrub tau from the brain, although 
this work is at an early stage. 

Jhon Kennedy and his 11 siblings each have 
a 50% chance of inheriting the Alzheimer’s 
mutation from their father. Eight have enrolled 
in Lopera’s study and could be among the first 
people to undergo tau imaging in Colombia; 
none know whether they carry the mutation. 

The study, which began in 2013, was 
designed to test whether crenezumab, a drug 
that clears amyloid plaques from the brain, 
could lessen symptoms of Alzheimer’s. In the 
past five years, Lopera’s team has recruited 252 
participants who were between 30 and 60 years 


old. Those who carry the Alzheimer’s muta- 
tion will begin accumulating amyloid in their 
brains during their early 30s, on average. 

Each participant in the study will receive 
infusions of crenezumab or a placebo every 
other week, for five years. Lopera’s team is also 
testing their cognitive abilities, scanning their 
brains for amyloid and searching for blood 
proteins and other biomarkers that could be 
early indicators of disease. 

The scientists hope to begin mapping the tau 
in participants’ brains in the coming months, 
after they receive final permission from regula- 
tors to produce the crucial radioactive marker 
GPT1. When that happens, the team will join 
a handful of researchers worldwide that are 
using the technology. “It’s quite remarkable that 
a population that lives in such a remote area is 
getting access to some of the most advanced 
technology for understanding Alzheimer’s,” 
says Kenneth Kosik, a neuroscientist at the 
University of California, Santa Barbara. 

Lopera and his colleagues want to determine 
how tau spreads in the brains of young people 
with Alzheimer’s, and whether that pattern 
mirrors the distribution of tau seen in elderly 
people with the disease. They hope to compare 
their results with data from two clinical trials 
of anti-amyloid drugs in the United States that 
have begun scanning participants’ brains for 
tau. Lopera, Kosik and other scientists are also 
beginning to identify families in Colombia 
with different genetic mutations that cause 
dementia or neurological disorders linked to 
tau, in the hope of imaging their brains too. 

Lopera says that his team will not release 
data from the crenezumab study until it 
finishes in 2022. But if the results are prom- 
ising, he adds, the researchers might give the 
drug to people under 30 who carry the muta- 
tion for early-onset Alzheimer’s. The scientists 
have identified almost 500 young people who 
might be carriers, including Jhon Kennedy’s 
15-year-old daughter. “She’s not worried about 
it, Jhon Kennedy says. “She’s like me — she 
lives day by day:’ = 
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MATERIALS SCIENCE 


Composite printer paves 
way for custom robots 


Four-in-one device mixes materials with speed and precision. 


BY MARK ZASTROW 


prototype 3D printer has for the 
At time combined several printing 

methods to enable researchers to 
produce devices from multiple materials in 
a single print run. So far, the machine has 
created basic electronic devices, but the tech- 
nology brings materials scientists a step closer 
to their goal of printing complex equipment 
such as robots or smartphones. 

The printer was presented at a meeting 
of the American Chemical Society in New 
Orleans, Louisiana, on 21 March. 

“This is aremarkable technological advance 
and a great leap for the field of 3D printing,” 
says Xuanhe Zhao, a materials scientist at the 
Massachusetts Institute of Technology in Cam- 
bridge, who was not involved in the work. 

The most common 3D printers heat a plas- 
tic filament and lay it down in repeated lines, 
building a layered structure from the bottom 
up. This is the technique used in inexpensive 
consumer models. Several other 3D-printing 
methods have also emerged in recent years; 
these include spraying fine streams of aerosols, 
printing with liquid resin that is then cured to 


form a flexible polymer, laying down thin lay- 
ers of ink that are dried and hardened when 
exposed to light, and even printing with ink 
that contains conductive nanoparticles, to pro- 
duce wires and circuits. 

“Each printing technology has its own 
limitations,” says Jerry Qi, a materials scien- 
tist at the Georgia Institute of Technology 
in Atlanta, who led the design of the multi- 
material printer. “We 


put four 3D-printing “Thisis a 
technologies under remarkable 
one platform” technological 
Although cur- advanceanda 
rent 3D printerscan great leap for 


already produce 
electronic parts and 
devices made of multiple materials, if a struc- 
ture requires more than one printing method 
a different machine is required for each. But 
moving an object from one printer to another 
is usually impractical for the micrometre-level 
precision that is required in 3D printing, and 
is inefficient if several materials are used in a 
single layer, says Qi. 

His team’s multimaterial printer has 
printheads — the nozzles that produce the 


the field.” 
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material — for each of the four techniques on 
a single printing platform. Each has its own 
software, lights for curing the materials, and 
a moving platform and robotic arms that can 
pick up and place components. This allows the 
printheads to work together to build single lay- 
ers with a range of materials. “It is a very smart 
solution to this challenge,” says Zhou. 

Qi and his team have used the printer to 
embed a light-emitting diode inside a plastic 
case, printing the inner circuitry at the same 
time as the outer enclosure. They have also 
printed a layer of conductive ink inside a 
rubbery material that can stretch while deliv- 
ering a current, demonstrating its potential for 
flexible electronics. 

The printer has excited researchers in 
the field. “We just had a conversation where 
we were dreaming of that kind of machine,’ 
says Geoff Spinks, a materials engineer at the 
University of Wollongong in Australia. Such 
stretchable electronics could be used in com- 
pact soft robots, providing flexibility that is 
impossible using current methods of embed- 
ding wires into a device, he says. 

The machine was funded mostly by the 
US Air Force and cost roughly US$350,000 to 
build, says Qi. He thinks the printer’s first cus- 
tomers will be aerospace companies that will 
use it to design circuitry for avionics, where 
printing antennas directly could enable rapid 
prototyping of avionic devices. Qi estimates 
the printer would sell for about $1 million. 

Spinks says this is only the beginning for 
multimaterial printers. He expects devices 
with expanded capabilities will soon become 
available: “I'd imagine in the near future we 
might have 16 different types of printheads.” m 


ENVIRONMENT 


Forecasting efforts target 
harmful plankton blooms 


Ocean scientists seek automated system to predict the growth of tiny marine denizens. 


BY JEFF TOLLEFSON 


assive blooms of a marine organism 
Me Noctiluca scintillans are cur- 

rently threatening fisheries, tour- 
ism and desalination plants in the Arabian 
Sea. But scientists are developing forecast- 
ing models that could help Oman and other 
countries in the region to predict when con- 
ditions seem ripe for a boom in these single- 
celled creatures, so that they can be better 
prepared. 


The goal is to produce a mostly automated 
system that works like a weather forecast. It 
would ingest data on atmospheric and ocean 
conditions, and kick out regular seven-day 
bloom forecasts — which would be available 
to businesses, governments and scientists. 

Such a system could also help researchers 
to understand the spread of plankton blooms 
in the Arabian Sea and beyond in the dec- 
ades to come, says Joaquim Goes, a biologi- 
cal oceanographer at the Lamont-Doherty 
Earth Observatory in Palisades, New York, 


and leader of the modelling project. The 
blooms might provide a short-term boost to 
some commercial fisheries, but marine bio- 
diversity could ultimately decline if Noctiluca 
continues to grow and dominate the ecosys- 
tem, he says. 

The project is one of several efforts around 
the world to develop models that can be 
used to understand and forecast harmful 
algal blooms. As early as this year, the US 
National Oceanic and Atmospheric Admin- 
istration (NOAA) expects to launch its first 
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operational forecast for toxic 
plankton blooms in the Gulf of 
Maine using a similarly advanced 
model, says Richard Stumpf, a 
NOAA oceanographer in Silver 
Spring, Maryland. 

Simulating, and then predicting, 
such blooms is difficult, Stumpf 
says, because researchers must 
understand not just local water 
chemistry and ocean circulation 
but also the behaviour of different 
plankton species. 


BOOM AND BUST 

A huge bloom in the Arabian Sea 
that began last November is only 
now winding down. At its peak in 
January, it covered an area three 
times the size of Texas. “Noctiluca 
is completely overwhelming the 
system,’ says Goes. 

Goes and his team spent five weeks in Oman 
in January and February this year collecting 
samples and talking to local fishermen, as well 
as industry and government officials. They 
discussed the forecasting system and the kind 
of bloom-prediction information community 
members would need. 

The team’s current forecasting model 
is promising, and the researchers hope to 
improve its resolution and to incorporate 
information on Noctiluca’s tricky biology. 

This minute organism has both plant 
and animal characteristics. It is a predator 
that thrives in the low-oxygen waters that 


Noctiluca blooms (green s 


have become more common in the region 
(H.R. Gomes et al. Nature Commun. 5, 4862; 
2014). But Noctiluca also has a symbiotic 
relationship with algae that live inside the 
cells. These algae contain the green pigment 
chlorophyll, which sensors on satellites can 
monitor from space, enabling researchers to 
track the blooms. 


A GROWING PROBLEM 

It's unclear why major Noctiluca blooms now 
occur in oceans around the world, including 
off the coasts of India, Thailand and Indo- 
nesia. Researchers suspect that the release 


wirls) in the Arabian Sea, as seen from space. 


of raw sewage, which provides 
nutrients, and global warming 
might have roles. 

The right conditions can stimu- 
late Noctiluca blooms so thick that 
when they die, the decomposing 
bodies suck most of the oxygen 
out of the seawater. In years past, 
the bloom’s expanding low-oxygen 
zone drove sardines onto some 
beaches in Oman; aquaculture 
farms have also been affected. 

This year, desalination plants 
had to scale back their activities 
to prevent Noctiluca from physi- 
cally clogging their systems. 

The current model devised 
by Goes and his team for the 
Indian Ocean has a resolution of 
3.5 kilometres. It’s able to forecast 
the bloom potential of two types 
each of plant and animal plankton. 
The model reproduces the large-scale behaviour 
of the current bloom in the Arabian Sea fairly 
well, but it’s not yet detailed enough to produce 
a useful forecast for a local shrimp farm ora 
desalination plant, says Sergio deRada, an ocean 
modeller at the Naval Research Laboratory at 
the Stennis Space Center in Mississippi. 

The next step is to boost resolution to 
1 kilometre or even 500 metres, and to repre- 
sent Noctiluca itself in the model. The prob- 
lem, says deRada, is that Noctiluca’s behaviour 
is more complex than that of the plankton 
currently included in the model. “That’s why 
modelling it is going to be tough.” m 
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Reduced- calorie diet shows 
signs of slowing ageing 


Restricting food intake dials down people’s metabolism in clinical trial. 


BY ALISON ABBOTT 


study of people who reduced the 
Az they consumed has found 
the strongest evidence yet that such 
restrictions can slow down human metabo- 
lism. The results raise hopes that a low-cal- 
orie lifestyle — or treatments that mimic the 
biological effects of restricted eating — could 
prolong health in old age, and even extend life. 
Past work in many short-lived animals, 
including worms, flies and mice, has shown 
that calorie restrictions reduce metabolism 
and extend lifespan. But experiments in 


longer-living humans and other primates 
are more difficult to conduct and have not 
yet drawn clear conclusions. 

The study was part of the multi-centre 
trial called CALERIE (Comprehensive 
Assessment of Long term Effects of Reduc- 
ing Intake of Energy), sponsored by the US 
National Institutes of Health. The rand- 
omized, controlled trial tested the effects of 
2 years of caloric restriction on metabolism 
in more than 200 healthy, non-obese adults. 

“The CALERIE trial has been important 
in addressing the question of whether the 
pace of ageing can be altered in humans,” 
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says Rozalyn Anderson, who studies ageing 
at the University of Wisconsin-Madison. 
She leads one of two large, independent 
studies on calorie restriction in rhesus mon- 
keys. “This new report provides the most 
robust evidence to date that everything we 
have learnt in other animals can be applied 
to ourselves.” 

Published on 22 March in Cell Metabolism, 
the latest study looked at 53 CALERIE par- 
ticipants who had been recruited at the 
Pennington Biomedical Research Center in 
Baton Rouge, Louisiana (L. M. Redman et al. 
Cell Metab. http://doi.org/cmrx; 2018). This 


CARLOS SANCHEZ 


Reduced food intake has affected people’s metabolism in a controlled study. 


facility is home to 4 of the world’s 20 or 
so state-of-the-art metabolic chambers, 
which are like small, sealed hotel rooms that 
measure minute-by-minute the amount of 
oxygen that occupants use and how much 
carbon dioxide they exhale. This allows 
researchers to track how the occupants use 
energy with unprecedented precision, says 
Anderson. The ratio between the two gases, 
combined with analysis of nitrogen in occu- 
pants’ urine, indicates whether the occupant 
is burning fat, carbohydrate or protein. 
The trial participants, aged between 
21 and 50, were randomized into two 
groups: 34 reduced their calorie intake by 
an average of 15%, and 19 ate as usual. At 
the end of each of the two years, they all 
underwent tests related to metabolism and 
biological markers of ageing. They were also 
placed in a metabolic chamber for 24 hours. 
The scientists found that participants on 
the diet used energy much more efficiently 
while sleeping than did the control group. 
This reduction in their base metabolic rate 
was greater than would be expected as a 
result of the test group’s weight loss, which 
averaged nearly 9 kilograms per participant. 
All the other clinical measurements were in 
line with reduced metabolic rate, and indi- 
cated a decrease in damage due to ageing. 


MODEL METABOLISM 

Caloric restriction has been known for 
decades to extend life in different species. 
In the 1990s, scientists began to identify the 
genes and biochemical pathways that are 
actively involved in longevity in the short- 
lived worm Caenorhabditis elegans, and 
in the fly Drosophila melanogaster. These 
include pathways relevant to insulin sensitiv- 
ity and the function of mitochondria — cel- 
lular structures that use oxygen to generate 
energy. Later studies revealed that calorie 
restrictions alter similar pathways in mice 
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and monkeys. Mice on restricted diets can 
live up to 65% longer than mice allowed to 
eat freely, and the ongoing monkey studies 
hint at longer survival and reduced signs of 
ageing. 

“The Rolls-Royce of a human longevity 
study would carry on for many decades to 
see if people do actually live longer,’ says 
Pennington physiologist Leanne Redman, 
the lead author of the latest study. CALERIE 
ran for two years, and aimed to see whether 
a calorie-restricted diet in humans induces 
some of the same metabolic, hormonal 
and gene-expression adaptations that are 
thought to be involved in slowing ageing 
in other species during long-term caloric 
restriction. 

Few people would want, or be able, to 
restrict their diet as severely as the partici- 
pants in the study. “But understanding the 
biology of how restricting calories extends 
life will allow us to find easier ways to 
intervene,’ says Anderson. 

Redman would like to repeat the study, 
combining less-ambitious calorie restriction 
with a diet containing antioxidant food to 
reduce damage from oxygen free radicals, or 
with a drug such as resveratrol, which mim- 
ics key aspects of calorie restriction. 

Other scientists are starting to test 
restricting calories for a few days every 
month. Such intermittent limitations have 
been found to be as effective as continu- 
ous calorie restriction in protecting mice 
against diseases of ageing such as diabetes 
and neurodegeneration (V. D. Longo and 
M. P. Mattson Cell Metab. 19, 181-192; 
2014). “I think that’s going to be a way to 
get all the benefits, without the problems of 
constant dieting,” says gerontologist Valter 
Longo of the University of Southern Cali- 
fornia in Los Angeles, who is embarking on 
clinical trials of intermittent calorie restric- 
tion in various disorders. m 


ti 
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Green light for 
exoplanet probe 


Europe’s ARIEL mission will 
study planetary atmospheres. 


BY ELIZABETH GIBNEY 


he world’s first space telescope 

| dedicated to studying the atmospheres of 

exoplanets is set to launch in 2028, after 

the European Space Agency (ESA) selected the 
mission for development on 20 March. 

The Atmospheric Remote-sensing Infrared 
Exoplanet Large-survey (ARIEL) — to be built 
with a budget of €450 million (US$558 million) 
— will chart more than 1,000 known planetary 
systems outside our own. Its mission is to under- 
stand the links between a planet’s chemistry 
and its environment, says Giovanna Tinetti, a 
planetary scientist at University College London 
and principal investigator for the mission. So 
far, exoplanet probes have mainly focused on 
finding new planets and measuring their sizes, 
masses and orbits. “ARIEL can really give us a full 
picture of what exoplanets are made of, how they 
form and how they evolve,’ says Tinetti. 

A spectrograph aboard the observatory will 
study the light that filters through a planet's 
atmosphere as it passes across the face of its host 
star, revealing chemical fingerprints of gases 
that shroud the body. The mission will also try 
to refine estimates of a planet’s temperature 
by teasing out how light from its star changes 
when the body moves behind it, revealing 
details about a planet’s overall radiation. Tinetti 
says that it will monitor the worlds at snapshots 
in time, and also measure how the chemistry 
and atmospheres of planets vary across their 
surfaces and with the seasons. 

Since the first exoplanet discoveries in the 
1990s, some 3,700 worlds have been confirmed 
in 2,800 systems. ARIEL will focus on planets 
that have an estimated temperature of more 
than 350°C — and are thus unlikely to host life 
as we know it — and a range of masses, in par- 
ticular those heavier than a few Earth masses. 
The observations will build a ‘standard model’ 
of howa planet's chemistry depends on its star 
and the conditions of its birth, Tinetti says. 

ARIEL is one of several exoplanet missions 
in the works around the world, but its focus on 
planetary atmospheres is rare. Other missions 
are studying planet size, density and mass. 
Together, this generation of telescopes will 
allow scientists to test models of planetary for- 
mation and composition, says Elizabeth Tasker, 
an exoplanet researcher at the Japan Aerospace 
Exploration Agency’s Institute of Space and 
Astronautical Science in Sagamihara. m 
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US science agencies reap 
budget windfall 


Government spending on research and development rises to a historic high in 2018. 


BY LAUREN MORELLO AND 
GIORGIA GUGLIELMI 


early all US science agencies will see 

| \ | their budgets grow in 2018, under a 

US$1.3-trillion spending deal enacted 

on 23 March. For the second year in a row, 

lawmakers in Congress ignored the steep cuts 
sought by President Donald Trump. 

The legislation boosts funding for the 
National Institutes of Health (NIH) to 
$37 billion, $3 billion over the 2017 level. The 
National Science Foundation receives $7.8 bil- 
lion, $295 million more than it got last year. 
And NASAs budget rises to $20.7 billion, an 
increase of $1.1 billion. 

One notable outlier to the overall trend is 
the Environmental Protection Agency, whose 
funding will stay frozen at the 2017 level of 
$8.1 billion. But that might be a victory of 
sorts, given that Trump had proposed slash- 
ing the agency’s budget by more than 30% in 
2018, to $5.7 billion. 

“This is a good deal,’ says Benjamin Corb, 
director of public affairs at the American Soci- 
ety for Biochemistry and Molecular Biology in 
Rockville, Maryland. “We're happy to see these 
numbers” (see ‘Budget bonanza’). The budget 
legislation lifts overall US government spend- 
ing on research and development to a historic 
high in 2018, according to an analysis by the 
American Association for the Advancement 
of Science in Washington DC. 

The deal includes increases for several of the 
NIH’ signature programmes. The BRAIN ini- 
tiative (Brain Research through Application of 
Innovative Neurotechnologies) receives $400 
million, up $140 million from 2017. And the 
All of Us research programme — a 10-year 
effort to track the health of 1 million Ameri- 
cans, which was formerly known as the Preci- 
sion Medicine Initiative — gets $290 million. 
That is a $60-million boost over last year. 


GUN DEBATE 

The Centers for Disease Control and Preven- 
tion (CDC) receives $8.3 billion, an increase of 
$1.1 billion over the 2017 level. Lawmakers in 
Congress reportedly debated whether to aban- 
don a long-standing provision that prohibits 
the CDC from advocating or promoting gun 
control, which has been interpreted widely as 
a ban on gun research funded or conducted 


BUDGET BONANZA 


The US Congress negotiated a 2018 spending deal that ignores the steep cuts to science funding sought 


by President Donald Trump. 
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by the CDC. But the spending bill stops short 
of repealing the provision, known as the 
Dickey Amendment. 

Instead, an official report accompanying 
the legislation notes that, although it includes 
language to prohibit the CDC and other agen- 
cies from funding activities to advocate or pro- 
mote gun control, “the Secretary of Health and 
Human Services has stated the CDC has the 
authority to conduct research on the causes of 
gun violence”. 

That “does nothing’, says David Hemenway, 
an economist who directs the Harvard Injury 

Control Research 


“This is a good Center at Harvard 
deal. We’re University in Bos- 
happy to see ton, Massachusetts. 


thesenumbers.” “Everybody knows 
that the Dickey 
Amendment does not prohibit federal funding 
of gun research.” But officials at the CDC also 
know that if they fund gun research, “they're 
going to get beaten up”. 

The budget deal sets aside $6.2 billion for 
NASA’ science directorate, an increase of 
$457 million from the 2017 level. That includes 
$595 million to send an orbiter and lander to 
Jupiter's moon Europa; Trump had sought $425 
million to send a spacecraft flying past that 
moon. The legislation also includes funding 
for four Earth-science programmes that Trump 
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had sought to cut, including the Orbiting 
Carbon Observatory-3, which would measure 
atmospheric carbon dioxide from space. 

Lawmakers also rejected Trump’ plan to can- 
cel the Wide-Field Infrared Survey Telescope 
(WFIRST), which would hunt exoplanets and 
dark matter — a project that US astronomers 
ranked as their top priority in a 2010 survey of 
science priorities for the next decade. 

The funding deal includes $6.2 billion for 
the Department of Energy’s Office of Science, 
an $868-million jump from the 2017 level. 
Trump had sought to cut its budget to just 
under $4.5 billion. Congress also rejected the 
president’s proposal to slash funding for the 
Advanced Research Projects Agency — Energy 
(ARPA-E), which pursues risky research that 
could yield major advances. The spending 
legislation includes $353 million for ARPA-E; 
Trump had proposed $20 million. 

Despite the windfall for science agencies, 
some budget experts warn that science advo- 
cates should not become complacent. “Don't 
look for a repeat of this next year,’ says Michael 
Lubell, a physicist at the City College of New 
York. The large increases for science agencies 
came after Congress moved in February to 
relax mandatory spending caps for 2018 and 
2019. That added $63 billion to the overall 
funding pot this year, but the number will grow 
only slightly in 2019, to $68 billion. m 


SOURCE: AAAS 
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Ancient genomes are revolutionizing the study of human 
prehistory but sometimes straining the relationships 
between archaeologists and geneticists. 


BY EWEN CALLAWAY 


hirty kilometres north of Stonehenge, 
through the rolling countryside of south- 
west England, stands a less-famous 
window into Neolithic Britain. Established 
around 3600 Bc by early farming communi- 
ties, the West Kennet long barrow is an earthen 
mound with five chambers, adorned with giant 
stone slabs. At first, it served as a tomb for some 
three dozen men, women and children. But peo- 
ple continued to visit for more than 1,000 years, 
filling the chambers with relics such as pottery 
and beads that have been interpreted as tributes 
to ancestors or gods. 
The artefacts offer a view of those visitors 
and their relationship with the wider world. 
Changes in pottery styles there sometimes 


echoed distant trends in continental Europe, 
such as the appearance of bell-shaped beakers 
—aconnection that signals the arrival of new 
ideas and people in Britain. But many archaeol- 
ogists think these material shifts meshed into a 
generally stable culture that continued to follow 
its traditions for centuries. 

“The ways in which people are doing things 
are the same. They're just using different mate- 
rial culture — different pots,’ says Neil Carlin at 
University College Dublin, who studies Ireland 
and Britain's transition from the Neolithic into 
the Copper and Bronze Ages. 

But last year, reports started circulat- 
ing that seemed to challenge this picture of 
stability. A study’ analysing genome-wide 
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The West Kennet long 
barrow served asa 
tomb and ceremonial 
site for more than a 
millennium. 
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data from 170 ancient Europeans, including 
100 associated with Bell Beaker-style artefacts, 
suggested that the people who had built the 
barrow and buried their dead there had all but 
vanished by 2000 sc. The genetic ancestry of 
Neolithic Britons, according to the study, was 
almost entirely displaced. Yet somehow the new 
arrivals carried on with many of the Britons’ tra- 
ditions. “That didn't fit for me, says Carlin, who 
has been struggling to reconcile his research 
with the DNA findings. 

The Bell Beaker ‘bombshell’ study appeared 
in Nature’ in February and included 230 more 
samples, to make it the largest ancient-genome 
study on record. But it is just the latest exam- 
ple of the disruptive influence that genetics has 
had on the study of the human past. Since 2010, 
when the first ancient-human genome was fully 
sequenced’, researchers have amassed data 
on more than 1,300 individuals (see ‘Ancient 
genomes’), and used them to chart the emer- 
gence of agriculture, the spread of languages 
and the disappearance of pottery styles — 
topics that archaeologists have laboured over 
for decades. 

Some archaeologists are ecstatic over the 
possibilities offered by the new technology. 
Ancient-DNA work has breathed new life and 
excitement into their work, and they are begin- 
ning once-inconceivable investigations, such 
as sequencing the genome of every individual 
from a single graveyard. But others are cautious. 

“Half the archaeologists think ancient DNA 
can solve everything. The other half think 
ancient DNA is the devil's work,” quips Philipp 
Stockhammer, a researcher at Ludwig-Maximil- 
ians University in Munich, Germany, 
who works closely with geneticists 
and molecular biologists at an insti- 
tute in Germany that was set up a few 
years ago to build bridges between the 
disciplines. The technology is no silver 
bullet, he says, but archaeologists ignore it 
at their peril. 

Some archaeologists, however, worry that 
the molecular approach has robbed the field 
of nuance. They are concerned by sweeping 
DNA studies that they say make unwar- 
ranted, and even dangerous, assumptions 
about links between biology and culture. 
“They give the impression that they’ve 
sorted it out,’ says Marc Vander Linden, 
an archaeologist at the University of 
Cambridge, UK. “That’s a little bit irri- 
tating” 

This isrtt the first time archaeologists 
have had to contend with transformative 
technology. “The study of prehistory today 
is in crisis,’ wrote Cambridge archaeolo- 
gist Colin Renfrew in his 1973 book Before 
Civilization, describing the impact of radio- 
carbon dating. Before the technique was devel- 
oped by chemists and physicists in the 1940s 
and 50s, prehistorians determined the age of 
sites using ‘relative chronologies, in some cases 
relying on ancient Egyptian calendars and false 


assumptions about the spread of ideas from the 
Near East. “Much of prehistory, as written in 
the existing textbooks is inadequate: some of it, 
quite simply wrong,” Renfrew surmised. 

It wasn't an easy changeover — early carbon- 
dating efforts were off by hundreds of years or 
more — but the technique eventually allowed 
archaeologists to stop spending most of their 
time worrying about the age of bones and arte- 
facts and focus instead on what the remains 
meant, argues Kristian Kristiansen, who studies 
the Bronze Age at the University of Gothenburg 
in Sweden. “Suddenly there was a lot of free 
intellectual time to start thinking about pre- 
historic societies and how they are organized.” 
Ancient DNA now offers the same opportunity, 
says Kristiansen, who has become one of his 
field’s biggest cheerleaders for the technology. 

Genetics and archaeology have been uneasy 


“THESE 
RESULTS WERE 
ASHOCK T0 THE 

ARCHAEOLOGICAL 
COMMUNITY.” 


Bell Beaker pots signal a period of unprecedented 
cultural intermingling for early Europeans. 
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bedfellows for more than 30 years — the first 
ancient-human DNA paper’, in 1985, reported 
sequences from an Egyptian mummy (now 
thought to be contamination). But improve- 
ments in sequencing technology in the mid- 
to-late 2000s set the fields on a collision course. 
In 2010, scientists led by Eske Willerslev at 
the Natural History Museum of Denmark used 
DNA froma lock of hair from a 4,000-year-old 
native Greenlander to generate the first com- 
plete sequence of an ancient-human genome’. 
Seeing the future of the field before his eyes, 
Kristiansen asked Willerslev to team up ona 
prestigious European Research Council grant 
that would allow them to examine human 
mobility as the late Neolithic gave way to the 
Bronze Age, some 4,000-5,000 years ago. 


ASSOCIATION PROBLEMS 

Migration has been a major source of tension 
for archaeologists. They have debated at length 
whether human movements are responsible 
for cultural changes in the archaeological 
record, such as the Bell Beaker phenomenon, 
or whether it is simply the ideas that are mov- 
ing through cultural exchanges. Populations 
identified by the artefacts they associated with 
came to be seen as a remnant of the science’s 
colonial past, and one that imposed artificial 
categories. “Pots are pots, not people,’ goes a 
common refrain. 

Most archaeologists have since cast aside 
the view that prehistory was like a game of 
Risk, in which homogenous cultural groups 
conquer their way across a map of the world. 
Instead, researchers tend to focus on under- 

standing a small number of ancient 

sites and the lives of the people who 
lived there. “Archaeology had moved 
away from these grand narratives,” 
says Tom Booth, a bioarchaeologist at 
the Natural History Museum in London, 
who is part of a team using ancient DNA to 
trace the arrival of farming in Britain. “A lot 
of people thought you needed to understand 
change regionally to understand people’ lives.” 
Ancient-DNA work — which has repeat- 
edly shown that a region's modern inhabitants 
are often distinct from populations that lived 
there in the past — promised, for better or 
worse, to bring back some of the broad focus 
on migration to human prehistory. “What 
genetics is particularly good at is detect- 
ing change in populations,’ says David 
Reich, a population geneticist at Harvard 
Medical School in Boston, Massachusetts. 
Archaeologists, Kristiansen says, “were 
prepared to accept that individuals had 
travelled”. But for the Bronze Age period that 
he studies, “they were not prepared for major 
migrations. That was a new thing.” 
Studies of strontium isotopes in teeth’, which 
vary with local geochemistry, had hinted that 
some Bronze Age individuals had moved 
hundreds of kilometres over their lifetimes, 
Kristiansen says. He and Willerslev wondered 
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whether DNA analysis might detect movements 
of whole populations during this period. 

They would have competition. In 2012, 
David Anthony, an archaeologist at Hartwick 
College in Oneonta, New York, loaded his car 
with boxes of human remains that he and his 
colleagues had excavated from the steppes near 
the Russian city of Samara, including bones 
associated with a Bronze Age pastoralist culture 
called the Yamnaya. He was bringing them to 
the ancient-DNA lab just established by Reich 
in Boston. Like Kristiansen, Anthony was com- 
fortable theorizing about the past on a grand 
scale. His 2007 book The Horse, the Wheel and 
Language proposed that the Eurasian steppe 
had been a melting pot for the modern devel- 
opments of horse domestication and wheeled 
transport, which propelled the spread of a fam- 
ily of languages called Indo-European across 
Europe and parts of Asia. 

In duelling 2015 Nature papers*”, the teams 
arrived at broadly similar conclusions: an influx 
of herders from the grassland steppes of present- 
day Russia and Ukraine — linked to Yamnaya 
cultural artefacts and practices such as pit burial 
mounds — had replaced much of the gene pool 
of central and Western Europe around 4,500- 
5,000 years ago. This was coincident with the 
disappearance of Neolithic pottery, burial styles 
and other cultural expressions and the emer- 
gence of Corded Ware cultural artefacts, which 
are distributed throughout northern and cen- 
tral Europe. “These results were a shock to the 
archaeological community,’ Kristiansen says. 


CORD CUTTERS 

The conclusions immediately met with 
push-back. Some of it began even before the 
papers were published, says Reich. When he 
circulated a draft among his dozens of collabo- 
rators, several archaeologists quit the project. 
To many, the idea that people linked to Corded 
Ware had replaced Neolithic groups in Western 
Europe was eerily reminiscent of the ideas of 
Gustaf Kossinna, the early-twentieth-century 
German archaeologist who had connected 
Corded Ware culture to the people of modern 
Germany and promoted a ‘Risk board’ view of 
prehistory known as settlement archaeology. 
The idea later fed into Nazi ideology. 

Reich won his co-authors back by explicitly 
rejecting Kossinna’s ideas in an essay included in 
the paper’s 141-page supplementary material’. 
He says the episode was eye-opening in showing 
how a wider audience would perceive genetic 
studies claiming large-scale ancient migrations. 

Still, not everyone was satisfied. In an essay® 
titled ‘Kossinna’s Smile’ archaeologist Volker 
Heydat the University of Bristol, UK, disagreed, 
not with the conclusion that people moved west 
from the steppe, but with how their genetic 
signatures were conflated with complex cul- 
tural expressions. Corded Ware and Yamnaya 
burials are more different than they are similar, 
and there is evidence of cultural exchange, at 
least, between the Russian steppe and regions 
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GENOMES 


More than 1,300 genome 
sequences have been produced 
from the remains of ancient 
humans, sometimes challenging 
the historical narratives derived 
from artefacts. 


Europe 
North 
America 
South 
America 
Oceania 


Afri 
ee from Africa this year. 


Asia 


NCC0C028 HS NR C888 


2013 2014 2015 


west that predate Yamnaya culture, he says. 
None of these facts negates the conclusions of 
the genetics papers, but they underscore the 
insufficiency of the articles in addressing the 
questions that archaeologists are interested in, 
he argued. “While I have no doubt they are basi- 
cally right, it is the complexity of the past that is 
not reflected” Heyd wrote, before issuing a call 
to arms. “Instead of letting geneticists determine 
the agenda and set the message, we should teach 
them about complexity in past human actions” 

Ann Horsburgh, a molecular anthropolo- 
gist and prehistorian at Southern Methodist 
University in Dallas, Texas, attributes such ten- 
sions to communication problems. Archaeol- 
ogy and genetics say distinct things about the 
past, but often use similar terms, such as the 
name of a material culture. “Its C. P. Snow all 
over again,” she says, referring to the influential 
“Two Cultures’ lectures by the British scientist 
lamenting the deep intellectual divide between 
the sciences and the humanities. Horsburgh 
complains that genetic results are too often 
given precedence over inferences about the 
past from archaeology and anthropology, and 


In the largest ancient-DNA study to 
date, scientists sequenced the remains 
of 400 Neolithic, Copper-Age and 
Bronze-Age Europeans, including 226 
associated with Bell Beaker artefacts. 


A 4,500-year-old skeleton from a cave in 
Ethiopia provided the first ancient-human 
genome sequence from Africa, providing 
context on movements back from Eurasia. 
Scientists expect many more examples 
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that such “molecular chauvinism” prevents 
meaningful engagement’. “It’s as though genetic 
data, because they’re generated by people in lab 
coats, have some sort of unalloyed truth about 
the Universe.” 

Horsburgh, who is seeing her own field of 
African prehistory start to feel the tremors 
of ancient genomics, says that archaeologists 
frustrated at having their work misinterpreted 
should wield their power over archaeological 
remains to demand more equitable partnerships 
with geneticists. “Collaboration doesn't mean I 
send you an e-mail saying ‘hey, you've got some 
really cool bones. I'll get you a Nature paper: 
That’s not a collaboration,’ she says. 

Many archaeologists are also trying to 
understand and engage with the inconvenient 
findings from genetics. Carlin, for instance, says 
that the Bell Beaker genome study sent him on 
“ajourney of reflection” in which he questioned 
his own views about the past. He has pored over 
the selection of DNA samples included in the 
study as well as the basis for its conclusion that 
the appearance of Bell Beaker artefacts coin- 
cided with a greater than 90% replacement in 
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Britain's gene pool. “I didn't want to be question- 
ing it from a position of ignorance,’ Carlin says. 

Like Heyd, he accepts that a shift in ancestry 
occurred (although he has questions about its 
timing and scale). Those results, in fact, now 
have him wondering about how cultural prac- 
tices such as leaving pottery and other tributes 
at the West Kennet long barrow persisted in the 
face of such upheavals. “I would characterize a 
lot of these papers as ‘map and describe. They're 
looking at the movement of genetic signatures, 
but in terms of how or why that’s happening, 
those things aren't being explored,’ says Carlin, 
whois no longer disturbed by the disconnect. “T 
am increasingly reconciling myself to the view 
that archaeology and ancient DNA are telling 
different stories.” The changes in cultural and 
social practices that he studies might coincide 
with the population shifts that Reich and his 
team are uncovering, but they don't necessarily 
have to. And such biological insights will never 
fully explain the human experiences captured 
in the archaeological record. 

Reich agrees that his field is in a “map-mak- 
ing phase’, and that genetics is only sketching 
out the rough contours of the past. Sweeping 
conclusions, such as those put forth in the 
2015 steppe migration papers, will give way to 
regionally focused studies with more subtlety. 

This is already starting to happen. Although 
the Bell Beaker study found a profound shift 
in the genetic make-up of Britain, it rejected 
the notion that the cultural phenomenon was 
associated with a single population. In Iberia, 
individuals buried with Bell Beaker goods were 
closely related to earlier local populations and 
shared little ancestry with Beaker-associated 
individuals from northern Europe (who were 
related to steppe groups such as the Yamnaya). 
The pots did the moving, not the people. 

Reich describes his role as that of a ‘mid- 
wife’ delivering ancient-DNA technology to 
archaeologists, who can apply it as they see 
fit. “Archaeologists will embrace this technol- 
ogy and will not be Luddites, he predicts, “and 
they'll make it their own” 


A STRONGER PARTNERSHIP 

Nestled in a sleepy valley in the state of 
Thuringia in former East Germany, the city of 
Jena has become an unlikely hub for the con- 
vergence of archaeology and genetics. In 2014, 
the prestigious Max Planck Society established 
an Institute for the Science of Human History 
there and installed a rising star in ancient- 
DNA research, Johannes Krause, as a director. 
Krause was a protégé of the geneticist Svante 
Paadbo, at the Max Planck Institute for Evo- 
lutionary Anthropology in Leipzig. There, 
Krause worked on the Neanderthal genome” 
and helped discover a new archaic human 
group, known as Denisovans"’. 

Whereas Paabo was focused on applying 
genetics to biological questions about ancient 
humans and their relatives, Krause saw a wider 
scope for the technology. Before leading the Jena 


institute, his team identified DNA from plague- 
causing bacteria in the teeth of people who died 
from the Black Death in the fourteenth century, 
the first direct evidence of a potential cause for 
the pandemic”. At Jena, Krause hoped to bring 
genetics to bear, not just on ‘prehistorical’ peri- 
ods such as the Neolithic and the Bronze Age, 
where archaeological methods are the main tool 
for reconstructing the past, but also on more- 
recent times. Outreach with historians is still a 
work in progress, but archaeology and genet- 
ics are thoroughly embedded at the institute. 
The department Krause directs is even called 
archaeogenetics. “We have to be interdiscipli- 
nary,’ he says, because geneticists are addressing 


"ARCHAEOLOGISTS 
WILL EMBRACE 
THIS TECHNOLOGY 
AND WILL NOT BE 
LUDDITES." 


questions and time periods that archaeologists, 
linguists and historians have been poring over 
for decades. 

Krause and his team have been heavily 
involved in the map-making phase of ancient 
genomics (he worked closely with Reich's team 
on many such projects). But a study published 
late last year’ that focused on the transition 
between the Neolithic and Bronze Age in 
Germany won plaudits from archaeologists who 
have been dubious of the larger-scale ancient- 
DNA studies. 

Led by Stockhammer, who also has a post at 
the Jena institute, the team analysed 84 Neo- 
lithic and Bronze Age skeletons from southern 
Bavaria’s Lech River Valley dating to between 
2500 and 1700 Bc. The diversity in the genomes 
of cellular structures known as mitochondria, 
which are inherited maternally, rose during this 
period, suggesting an influx of women. Mean- 
while, strontium isotope levels in teeth — which 
are set during childhood — suggested that most 
females werent local. In one case, two related 
individuals who lived within a few generations 
of each other were found buried with different 
material cultures. In other words, some cultural 
shifts in the archaeological record could be due 
not to massive migrations, but to the systematic 
mobility of individual women. 

It is the prospect of more such studies that 
has archaeologists salivating over ancient DNA. 
In the near future, says Stockhammer, archae- 
ologists will be able to sequence the genomes 
of all the individuals at a burial site and build a 
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local family tree, while also determining how 
individuals fit into larger ancestry patterns. This 
should allow researchers to ask how biological 
kinship relates to the inheritance of material 
culture or status. “These are the big questions 
of history. They can be solved now only with 
collaboration,’ says Stockhammer. 

Another glimpse of this approach appeared in 
February on the bioRxiv preprint server’*. The 
paper explores Europe's migration period, when 
‘barbarian hordes filled the void left after the fall 
of the Roman Empire. In the paper, a team of 
geneticists, archaeologists and historians built 
family trees of 63 individuals from two medieval 
cemeteries in Hungary and northern Italy asso- 
ciated with a group known as the Longobards. 
They found evidence of high-status outsiders 
buried in the cemetery: most bore central and 
northern European genetic ancestry that dif- 
fered from that of local people, who tended to 
be buried without goods — offering tentative 
support to the idea that some barbarian groups 
included outsiders. 

Patrick Geary, a medieval historian at the 
Institute for Advanced Study in Princeton, 
New Jersey, who co-led the Longobard study, 
would not comment on the research because 
it is now being peer reviewed. But he says that 
genetic studies of historical times, such as the 
migration period, carry pitfalls, too. Histori- 
ans are increasingly incorporating data such 
as palaeoclimate records into their work, and 
will do likewise with ancient DNA, Geary says. 
But they share archaeologists’ fears that biology 
and culture will be conflated, and that problem- 
atic designations such as Franks or Goths or 
Vikings will be reified by genetic profiles, over- 
riding insights into how ancient peoples viewed. 
themselves. “These days, what historians want 
to know about is identity,’ he says. “Genetics 
cannot answer these questions.’ 

Reich concedes that his field hasr’t always 
handled the past with the nuance or accuracy 
that archaeologists and historians would like. 
But he hopes they will eventually be swayed by 
the insights his field can bring. “We're barbar- 
ians coming late to the study of the human past,” 
Reich says. “But it’s dangerous to ignore barbar- 
ians.” m SEE EDITORIALP.559 


Ewen Callaway writes for Nature from 
London. 
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Cyberspy Shannen Rossmiller posed as male Iraqi and Afghan militants in extremist chat rooms to expose weapon caches and bomb plots. 


Cybersecurity needs women 


Safeguarding our lives online requires skills and experiences that lie beyond 
masculine stereotypes of the hacker and soldier, says Winifred R. Poster. 


( } omputer hacking is becoming more 
widespread and damaging. Headlines 
highlight attacks on government 

agencies, political campaign offices, finan- 

cial institutions and big corporations. But 
citizens and consumers are paying a heavy 
price. In 2016, 2 billion people had their per- 
sonal details stolen, including the medical 
records of more than 100 million Americans. 

Hacks of US retail outlets such as Target and 

global credit companies such as Equifax 

compromised the private data of hundreds 
of millions of customers. In the past 6 years, 
more than US$107 billion was stolen from 

US consumers through identity theft. 
Cybercrime exacerbates inequalities. A 


million more US women than men had their 
identities stolen in 2014. People of African 
American and Latino descent are, on aver- 
age, two to three times more likely than 
white people to be victims of fraud related 
to debt or income. And women and girls 
are more likely than men to be targets of 
‘remote sexual abuse’ — coerced into pos- 
ing nude online or being stalked through 
the Internet. 

Security technologies also disadvantage 
women and other groups. For example, bio- 
metric facial recognition systems have trouble 
identifying the faces of women and people of 
colour. Airport security systems and opera- 
tors disproportionately flag black women for 


strip searches relative to other passengers’. 
Cybersecurity professionals — who 
protect databases, software systems and 
computer networks from access, change 
or destruction — are predominantly male. 
Women comprise only 11% of these profes- 
sionals worldwide, and only 14% in North 
America (see ‘Women in cybersecurity’). By 
comparison, women make up 57% of the US 
workforce. Even cybersecurity’s sister indus- 
tries do better: 15% of the US military and 
25% of staff in information technology are 
women (see ‘Sister fields’). By 2020, 2 million 
more cybersecurity jobs will be needed 
worldwide in addition to the 3.2 million 
people who are already employed in the 
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> field, of whom almost 750,000 are in the 
United States. 

Cybersecurity’s future depends on its 
ability to attract, retain and promote women, 
who represent a highly skilled and under- 
tapped resource. The discipline also needs 
to learn about women’s experiences as 
victims of cybercrime and the steps needed 
to address the imbalance of harm. 

Here I highlight four ways in which the 
field should adapt. 


FOUR PRIORITIES 

Acknowledge women’s contributions. 
Women have been working in cybersecurity 
for a century. Yet many of their stories have 
been sidelined because of the secrecy of the 
work, its wartime contexts or because male 
colleagues have been put in the limelight 
instead. 

During the Second World War, the United 
States employed 10,000 women as ‘code 
girls to decipher encrypted messages sent 
by the Japanese and Germans. Likewise, 
the United Kingdom hired more than 7,000 
women to work at Bletchley Park, its centre 
for cryptanalysis, where they made up about 
three-quarters of the workforce. And in 
1942, actor and inventor Hedy Lamarr pat- 
ented an encryption method for signalling to 
torpedoes that is now the basis of WiFi and 
bluetooth technology. 

Elizebeth Smith Friedman helped to 
invent the science of cryptography for the 
US Federal Bureau of Investigation (FBI)’. 
Her techniques in the 1940s broke interna- 
tional spy rings, decoded three Nazi Enigma 
machines and contributed to the early work 
of the forerunner to the Central Intelligence 
Agency. Yet after the war, her elite code- 
breaking unit was shut down and various 
men took credit for her work, including 
her husband William Friedman and FBI 


WOMEN IN CYBERSECURITY 


director J. Edgar Hoover. Indeed, Hoover 
showed outright hostility towards women 
in this field — when he started as director 
of the FBI’s predecessor in 1924, he fired all 
the female agents and banned further recruit- 
ment of women in these roles. 

Women were the first programmers, 
calculating weapons trajectories by hand and 
entering them into the Electronic Numerical 
Integrator And Computer (ENIAC) at the 
University of Penn- 


sylvania, Philadel- “Women have 
phia, in the 1940s°. held influential 
In fact, ‘computer’ positions in 
originally referred US national 


nottothe machine cybersecurity.” 
but to the women 

who programmed it. Other women devel- 
oped the first programming languages, 
methods to detect intrusions into computer 
systems and network bridges between com- 
munication centres*. In the 1950s, African 
American female mathematicians at NASA 
calculated the aeronautical trajectories that 
put men on the Moon’. The proportion of 
women in computer science grew until the 
mid-1980s — the dawn of personal comput- 
ing — when numbers dropped precipitously. 
Today, only around 18% of US computer- 
science majors are women, compared with 
around 37% in 1984. 

Even so, in the past decade, women have 
held influential positions in US national 
cybersecurity. Theresa Payton was the 
first female chief information officer in 
the White House, under former president 
George W. Bush. Former president Barack 
Obama appointed Melissa Hathaway in 
2009 as his first ‘cyber czar’ in the role of 
acting senior director for cyberspace for the 
National Security Council. Letitia Long was 
the first woman to head a major US intel- 
ligence agency, as director of the National 


Hostile work cultures, stereotypes and discrimination are limiting the numbers* of female 
cybersecurity professionals (left). Just over half of women in cybersecurity say that they have 
experienced gender discrimination (right), compared with 15% of men. 
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Geospatial-Intelligence Agency from 2010 to 
2014. This supplied the satellite, geographi- 
cal and social-media data that enabled Osama 
bin Laden's capture. Janet Napolitano headed 
the Department of Homeland Security (DHS) 
from 2009 to 2013, and Kirstjen Nielsen has 
been at the helm since 2017. Jeannette Man- 
fra, the chief cybersecurity official for the 
DHS, is leading the investigation into Russian 
hacking of US voter registration rolls before 
the 2016 presidential election. 

Research leadership is crucial because 
it dictates the direction of security tech- 
nologies and strategies. From 2009 to 2012, 
Regina Dugan was the first female director 
of the Defense Advanced Research Projects 
Agency (DARPA), the research wing of the 
military that helped to develop the Internet 
and the Global Positioning System. Her 
priorities included machine intelligence, 
flexible and manoeuvrable shape-shifting 
devices, and soldier telepathy using brain- 
wave mapping for communicating without 
speech. Dugan has since run technology 
programmes at Google and Facebook. 
From 2008 to 2012, Lisa Porter was founding 
director of ‘DARPA for spies’ — the Intelli- 
gence Advanced Research Projects Activity. 
She led investigations into quantum comput- 
ing, biometric fingerprint identification and 
cloaking devices. 

Although there is a predominance of white 
women in these positions, women of colour 
are making inroads. Quiessence Phillips, an 
African American, has been selected as dep- 
uty chief information security officer of cyber 
command for the City of New York. And 
Indian computer scientist Aanchal Gupta is 
director of security at Facebook. 

Other women are taking more under- 
ground roles — as cyberspies’. Former judge 
Shannen Rossmiller gathered global online 
intelligence for the FBI after the terrorist 
attacks of 11 September 2001. By posing as 
male militants from Iraq and Afghanistan in 
extremist chat rooms, she exposed weapon 
caches, bomb plots and terror cells in more 
than 200 operations. Indeed, 35% of people 
who work in the intelligence field are female. 

Meanwhile, cybersleuths like Kimberly 
Ritter are fighting crimes such as online 
sex trafficking. Ritter’s work inspired com- 
puter scientists to develop an app called 
Traffickcam that enables the public to upload 
photographs of hotel rooms to a database. By 
matching these to images in advertisements 
for escorts, law enforcement agencies can 
improve the way they track down victims 
and their traffickers. 

But breaking the glass ceiling is not 
enough. Institutional barriers also need to 
be overcome. 


Recognize diverse expertise. Despite 
being few in number, female candidates for 
cybersecurity jobs tend to be more educated 
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Pennsylvania, Philadelphia, in the 1940s. 


than their male counterparts. Women 
professionals are more likely to have a 
master’s degree or higher (51% for women 
globally, compared with 45% of men). 
Women also tend to bring wider expertise. 
Although both female and male employees 
train extensively in computer science, infor- 
mation and engineering, women's degrees are 
more likely to come from fields such as busi- 
ness, mathematics and social science (44% 
for women, compared with 30% for men). 
This is crucial because cybersecurity jobs 
demand diverse skills. Professionals must 
understand network security, risk mitigation 
and information protection, and be prepared 
for future activities in artificial intelligence, 
machine learning and virtual-reality map- 
ping. They need to manage projects, navigate 
legal and compliance codes, and work in sec- 
tors from health care to law enforcement’. 
But training beyond fields in science, 
technology, engineering and maths (STEM) 
is not generally rewarded. Most cybersecu- 
rity employers view computer science or 
engineering backgrounds as a priority. Yet 
even some big tech companies are changing 
their outlook. In 2011, Microsoft co-founder 
Bill Gates disparaged liberal-arts education. 
Now, the company’s president Brad Smith 
argues that such training is crucial for the 
future of computing, especially artificial 
intelligence. Google executives were shocked 
after they analysed their workforce data and 
found that STEM expertise was the least 
important factor associated with employ- 
ees’ success. Instead, qualities such as being 


a good coach, problem-solver or critical 
thinker ranked higher. 


Shed sexist images. The two fields that are 
most closely associated with cybersecurity 
— IT and the military — are plagued by 
cultures that are hostile towards women. 

The IT industry reveres the hacker per- 
sona — a loner, typically male and white, 
working all night and forgoing other spheres 
of life. Some employees might identify with 
this image. Nonetheless, employers often use 
it as a standard for hiring and promoting, 
even though the workforce has diversified. As 
ethnomathematician Ron Eglash observed 
in 2002, these racial and gender premises of 
the hacker stereotype are being challenged by 
emerging groups such as “Black geeks, Asian 
American hipsters, and geek grrrls””. 

Sexist attitudes are rife in California's 
Silicon Valley, exemplified by the memo 
leaked in 2017 from a male employee of 
Google arguing that women are biologi- 
cally ill-suited to the technology field. The 
ride-hailing firm Uber received more than 
200 claims of sexual harassment, discrimi- 
nation and inappropriate behaviours from 
employees, according to an investigation at its 
San Francisco headquarters in 2017. The US 
federal government has filed lawsuits against 
tech firms for gendered wage discrimination. 

The soldier image is rooted in military 
legacies. Women in cybersecurity often 
report working with an ‘old boys club’ of for- 
mer intelligence and military officers®. Job 
postings call for “ninjas” and “cyberwarriors”. 


The language of cybersecurity reflects this 
ethos of defending networks against threats 
from intruders. By contrast, the concept of 
information security — centred on creating 
safe, effective systems and protecting humans 
who use them — describes the job better and 
is more widely appealing to diverse practi- 
tioners, including women. 

Cybersecurity conferences are notoriously 
male-dominated in number and hyper- 
masculine in behaviour. A female attendee 
can be the sole woman ina group of 100 men. 
Two of the largest cybersecurity confer- 
ences, DEF CON and Black Hat, are held in 
Las Vegas, Nevada — a city long associated 
with the objectification of women. Women 
at these meetings have described experiences 
to scholars and online peer networks of being 
groped and asked by male attendees if they 
are a secretary or a prostitute. 

The impacts are clear. More than half of 
women in cybersecurity around the world 
(51%) say that they have experienced gen- 
der discrimination, compared with 15% of 
men. This is sometimes overt but more often 
subtle — through tokenism, unconscious 
bias, the highlighting of mistakes, or denials 
and delays in career advancement. Female 
cybersecurity employees report a pay gap 
of 3-6% compared with male employees. 
This is much smaller than the 28% wage gap 
in the computing industry, but such losses 
accumulate over the course ofa career. In the 
past couple of years, the gap has widened for 
female managers. 

Women in technology leave their jobs 
at a higher rate than their male colleagues. 
The costs for US firms of losing and rehiring 
professionals who quit because of gender, 
racial, sexuality or religious bias are around 
$64 billion annually. 


Realize that women and girls are prime 
targets of cybercrime. Women in the 
United States were 26% more likely than 
men to experience identity theft in 2008, 
often involving the fraudulent use of a bank 
account or credit card. Two-thirds of victims 
lost money. On average, women also took 
longer than men to notice the breach: 83 ver- 
sus 45 days. This is partly because men are 
more likely to bank and shop online, and so 
receive automatic notifications within hours 
— much faster than the weeks it can take to 
spot unauthorized transactions on monthly 
financial statements. Women in the United 
States also live in poverty to a greater degree 
than men do, and are less able to invest in 
online security services, freezing credit 
scores or hiring lawyers. 

Some cybercrimes, such as ‘sextortion; are 
directed at women and girls. Criminals trick 
their targets, using e-mails infected with 
software viruses, for instance, to gain access 
to their computers. Perpetrators can search 
for photos on hard drives or use webcams to 
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watch women and girls. They then blackmail 
targets by threatening to post images and 
videos on child pornography websites, to 
compel further sexual activity. The extortion 
can extend for years, because images can be 
permanently installed on multiple platforms. 
One perpetrator might have hundreds of 
targets. For example, 78 criminal cases of 
sextortion filed since 2016 may involve as 
many as 6,500 victims across 52 jurisdic- 
tions (US and international), 29 US states or 
territories and 3 foreign countries. 

Strangers are usually assumed to be the 
culprits. In reality, women’s spouses, boy- 
friends or family members often initiate 
security breaches. Abusers glean passwords 
through key-logging devices or by coercing 
their targets to hand over passwords. Mis- 
use of technology online is linked to physical 
abuse offline. In a national survey, 97% of US 
domestic abuse shelters reported that their 
female clients had been harassed through 
technology. 


NEXT STEPS 

Cybersecurity needs to become more 
accepting of and welcoming to women. 
Schools and universities should emphasize 
the field’s creativity and real-world applica- 
tions. Programmes that promote technol- 
ogy for young women and girls should be 
encouraged, such as Girls Who Code, which 
aims to increase women in computer science 
and engineering. In another initiative, the 
US Girl Scouts organization is rolling out 
cybersecurity as one of its skill badges. 

Media outlets and conference organizers 
should invite female experts and speak- 
ers. There are more than enough to choose 
from: the Grace Hopper Celebration (named 
after one of the first women coders) meets 
annually and brought together around 
18,000 women technologists last year. 
WiCYyS and the Diana Initiative are smaller 
but more specialized networks that also pro- 
mote women in cybersecurity. 

Employers should craft job advertise- 
ments that use inclusive wording, and seek 
candidates beyond computer science and 
the military, such as from the social sciences, 
humanities, law and public policy. Recruit- 
ers should use diverse selection panels 


SISTER FIELDS 
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than does cybersecurity. 
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NASA mathematician Katherine Johnson did 
the calculations that put men on the Moon. 


and gender-blinded screening of résumés. 
Women should be hired in cohorts to avoid 
isolation and experiences of tokenism. 
Retaining women requires fair career 
trajectories. Employers should track and 
enforce gender parity in evaluations, promo- 
tions and salaries. Some firms are showing 
progress. The Anita Borg Institute’s list of 


Top Companies for 
“Retaining Women Technolo- 
womenrequires _ gists highlighted 
fair career the firms Accen- 


ture, Geico and 
ThoughtWorks as 
its winning examples in 2017. 

Researchers need to answer pressing 
questions about gender in cybersecurity. 
For example, how can retail and credit firms 
better protect women from the fallout of 
hacks and identity theft? How can domes- 
tic partners be prevented from accessing 
passwords? Regional disparities also need 
examining. For instance, Europe has fewer 
women in cybersecurity (7%) than North 
America does (14%), despite similar levels of 
industrialization. India has a better record of 
graduating female computer programmers 
(30%) than does the United States (21%)’. 

Cybersecurity professionals and 
organizations need to build partnerships 
to understand the types of people who are 
affected by cybercrime. In the United States, 
organizations might include the National 
Network to End Domestic Violence, the 
National Association for the Advancement 
of Colored People, UnidosUS (a non-profit 
Latino advocacy organization), the Federal 
Trade Commission and the Consumer 
Financial Protection Agency. 

The purpose and practice of cybersecurity 
must be questioned. Huge amounts of money, 


trajectories.” 


technology and resources are often allocated 
to command-and-control strategies that 
can be inefficient and counterproductive”’. 
Common-sense solutions, such as enforcing 
implementation of security patches and 
strengthening privacy protections on con- 
sumer electronics, can be simpler, cheaper 
and more effective. 

Security systems must protect everyone, 
equally. Celebrating, attracting, training 
and retaining a greater diversity of scholars 
and technicians in information-security 
research, business and governance are steps 
in that direction. = 


Winifred R. Poster is a lecturer in 
international affairs at Washington 
University, St. Louis, Missouri, USA. 
e-mail: wposter@wustl.edu 
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CORRECTIONS 

In data used for the graph ‘Open countries 
have impact’ in the Comment ‘Open 
countries have strong science’ (Nature 
550, 32-33; 2017), some articles 
classified into more than one field were 
counted twice. The online version of 

the graph has been updated to show 
disaggregated counts. The categorization 
of countries has not changed. 

In the Comment ‘How to make 
replication the norm’ (Nature 554, 
417-419; 2018) mistakenly stated 
that only authors of non-replicated 
articles from the 3ie project described 
antagonism. In fact, authors of both 
replicated and non-replicated studies did. 
Also, the number of replications was 21, 
not 20. The data set is now available as 
supplementary information. 

The Comment ‘Smartphones are bad 
for some teens, not all’ (Nature 554, 
432-434; 2018) misrepresented the 
results from the study on depressive 
episodes. The initial year should have 
been 2005, not 2004. And the rise 
between 2005 and 2014 should have 
been in percentage points, not per cent. 
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EUGENE RICHARDS 


Physicist Freeman Dyson, pictured in 2009. 


A life of scientific 


delight 


Ann Finkbeiner is charmed by the originality and 
acuity in Freeman Dyson’s letters. 


inclined physicist who proved in 1949 

that competing theories of quantum 
behaviour were equivalent. In a career 
spanning seven decades, he branched out 
into myriad fields. They included con- 
densed-matter physics, nuclear reactors, 
astronomical technology, extraterrestrial 
habitation and advising the US government 
on national security, sometimes as part of the 


hee Dyson is a mathematically 


elite post-war research group JASON. 
Dyson grew up in Winchester, UK, and 
leftin 1941 for the University of Cambridge, 
where he studied mathematics. There, he 
began a lifelong habit of writing regular 
letters home. Now, aged 94 and living in 
the United States, he has published some 
of them in Maker of Patterns. They cover 
a remarkable range of scientific interests, 
acquaintances, opinions and adventures. 


BOOKS & ARTS 


—— = The patterns Dyson 
un says he made were 

first those of ideas 

in mathematics and 
physics, and then 
| those in his writing 
about literature and 
history. Readers might 
hope that Dyson’s own 
pattern — the refer- 
ence frame in which 
his remarkable range 
becomes a coherent 
whole — would be 
found in his letters. 
As a writer who has interviewed Dyson, I 
would advise against such hope. 

The letters read like a travel journal written 
for people he loves and trusts. “I think the 
reason I write so openly is just this,” he tells 
his parents in 1949, “that all these adventures 
in this strange new world are still somewhat 
unreal to me, and in writing to you about 
them, I bring them in contact with my famil- 
iar world and lend them some of your reality.” 

Dyson notices everything. He describes 
Americans friendliness as a result of their 
inattention to the past, and thus of their lone- 
liness in time. He watches astronomers react 
to an unexpected discovery: how they were 
impressed into silence and how the theorists 
mumbled but could say nothing coherent. He 
quotes and describes children, his own and 
other people's: “the children have come out 
of doors in summer clothes... [The] grass is 
green and the crocuses are out. The children 
are more brightly coloured than the crocuses.” 

He describes his “extraordinary collection 
of friends”. At Cambridge's Trinity College, 
he writes, he could always find its splendid 
mathematicians — including G. H. Hardy — 
at the local billiard table. At Cornell Uni- 
versity in Ithaca, New York, he worked with 
physicists Hans Bethe (“he says he would have 
been willing to use atomic bombs to defeat 
Germany, so convinced was he of the evil 
character of the government”) and Richard 
Feynman (“sizzling with new ideas, most of 
which are more spectacular than helpful”). At 
the Institute for Advanced Study in Princeton, 
New Jersey, he hada complex friendship with 
atom-bomb pioneer J. Robert Oppenheimer 
(“he is moving around nervously all the time, 
never stops smoking, and I believe that his 
impatience is largely beyond his control”). 

Some of Dyson's most vivid descriptions, 
however, are about encounters and individ- 
uals outside science. A theology student at 
Cambridge “has a deep and resonant bass 
voice and talks in a slow and rhythmical 
way, with a solemn ring on the open vowels 
which is enough to send shivers down one’s 
spine”. A disabled boat-builder and his girl- 
friend “walk slowly down the beach ... like 
the last act of King Lear, when the crazy old 
king and his faithful daughter Cordelia 


Maker of Patterns: 
An Autobiography 
Through Letters 
FREEMAN DYSON 
Liveright: 2018. 


29 MARCH 2018 | VOL 555 | NATURE | 581 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 
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are led away to their doom”. 

He says what you wouldn't expect; if 
Dyson has a pattern, perhaps it is contra- 
riety. He prefers the students at Haver- 
ford College, a liberal-arts university in 
Pennsylvania — who are “ostentatiously 
ill-dressed, uncombed, and unwashed” — 
to Princeton's “pampered” ones. He con- 
tradicts himself. His wartime job at the 
UK Royal Air Force’s Bomber Command, 
analysing the effect of Allied firebomb- 
ing on German 
cities, left him 
with a “perma- 
nently bad con- 
science”. Yet he 
understood the 
joy of German 
sailors who torpedoed Allied fuel tankers 
because he felt “elation” when the fire- 
bombing succeeded. 

In the late 1950s, he worked with the 
defence contractor General Atomic 
in La Jolla, California, on a project he 
thought would be “legendary”: a space- 
ship called Orion, ill-advisedly powered 
by nuclear explosions. Because building 
it required nuclear testing, he publicly 
opposed the Nuclear Test Ban Treaty; 
after Orion was defunded, he supported 
the treaty. Later, he became president of 
the anti-nuclear-proliferation Federation 
of American Scientists. He proposed the 
‘no first use policy on nuclear weapons, 
while admiring his theoretical-physicist 
friend Edward Teller for standing against 
the treaty. Dyson's contradictions might 
seem confusing, but I can hear him 
politely and methodically explaining the 
rationality of seeing reality from different 
reference frames, of holding mutually 
opposing views and of being free to 
change one’s mind. 

The one Dysonian pattern for which 
the letters hold unequivocal evidence 
is delight. He uses the word often and 
invokes it even more: “Today I discov- 
ered a little theorem which gave me some 
intense moments of pleasure. It is beauti- 
ful and fell into my hand like a jewel from 
the sky.” In the book's last letter, he writes 
to his sister about being robbed and hit 
on the head. Preparing to die, he notices 
the sunlight on the bushes and thinks, “life 
has been good to me and this death is also 
good, with the bright sun and the green 
bushes”. 

Maybe with some people, you don't 
look for patterns. You just enjoy their 
multivariate company. = 


Ann Finkbeiner is a freelance science 
writer in Baltimore, Maryland, and 
author of The Jasons. She blogs at www. 
lastwordonnothing.com. 

e-mail: anniekf@gmail.com 
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Niels Bohr (left) with Albert Einstein in the late 1920s, when quantum mechanics was in its infancy. 


Quantum-theory wars 


Ramin Skibba explores a history of unresolved 
questions beyond the Copenhagen interpretation. 


Il hell broke loose in physics some 
A” years ago. Quantum theory 
emerged — partly in heated clashes 
between Albert Einstein and Niels Bohr. 
It posed a challenge to the very nature of 
science, and arguably continues to do so, by 
severely straining the relationship between 
theory and the nature of reality. Adam 
Becker, a science writer and astrophysicist, 
explores this tangled tale in What Is Real?. 
Becker questions the hegemony of the 
Copenhagen interpretation of quantum 
mechanics. Propounded by Bohr and Werner 
Heisenberg in the 1920s, this theory holds 
that physical systems have only probabilities, 
rather than specific properties, until they’re 
measured. Becker argues that trying to parse 
how this interpretation reflects the world 
we live in is an exercise in opacity. Showing 
that the evolution of science is affected by 
historical events — including sociological, 


What Is Real?: The 
Unfinished Quest 
for the Meaning of 
Quantum Physics 
ADAM BECKER 


Basic: 2018. 


cultural, political and 
economic factors — 
he explores alterna- 
tive explanations. 
Had events played 
out differently in the 
1920s, he asserts, our 
view of physics might 
be very different. 
Becker lingers 
on the 1927 Solvay 
Conference in 
Brussels, where 29 
brilliant scientists 
gathered to discuss 
the fledgling quantum 


theory. Here, the disagreements between 
Bohr, Einstein and others, including Erwin 
Schrédinger and Louis de Broglie, came to a 
head. Whereas Bohr proposed that entities 
(such as electrons) had only probabilities if 
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they weren't observed, Einstein argued that 
they had independent reality, prompting his 
famous claim that “God does not play dice”. 
Years later, he added a gloss: “What we call 
science has the sole purpose of determining 
what is.” Suddenly, scientific realism — 
the idea that confirmed scientific theories 
roughly reflect reality — was at stake. 

Quantum phenomena were phenomenally 
baffling to many. First was wave-particle 
duality, in which light can act as particles and 
particles such as electrons interfere like light 
waves. According to Bohr, a system behaves as 
a wave or a particle depending on context, but 
you cannot predict which it will do. 

Second, Heisenberg showed that 
uncertainty, for instance about a particle’s 
position and momentum, is hard-wired into 
physics. Third, Bohr argued that we could 
have only probabilistic knowledge of a system: 
in Schrédinger’s thought experiment, a cat in 
a box is both dead and alive until it is seen. 
Fourth, particles can become entangled. For 
example, two particles might have opposite 
spins, no matter how far apart they are: if you 
measure one to be spin up, you instantly know 
that the other is spin down. (Einstein called 
this “spooky action at a distance”) 

Becker explains how these observations 
challenge locality, causality and determinism. 
In the classical world of billiard balls, projec- 
tiles and apples falling from trees, they were 
never problems. 

Sifting through the history, Becker shows 
how Bohr, as an anti-realist, brought to 
his side many rising physicists, including 
Heisenberg, Wolfgang Pauli and Max Born. 
Einstein, however, persistently argued that 
the Copenhagen interpretation was incom- 
plete. He conjectured that there might be 
hidden variables or processes underlying 
quantum phenomena; or perhaps ‘pilot 
waves, proposed by de Broglie, govern the 
behaviour of particles. In 1932, mathemati- 
cian John von Neumann produced a proof 
that there could be no hidden variables in 
quantum mechanics. Although mathemati- 
cally correct, it was revealed to be flawed dec- 
ades later. But the damage had been done: the 
potentially viable alternatives conceived by 
Einstein and de Broglie remained relatively 
unexplored. The Copenhagen interpretation 
had taken hold by the 1930s, and textbooks 
today state that Bohr’s view ‘won: 

Thus, the Solvay Conference can be seen 
as a stand-off between two mathematically 
equivalent but fundamentally different 
paradigms: Bohr’s instrumentalist view of 
quantum physics and Einstein’s realist one. 
Inscience, a dominant paradigm determines 
which experiments are done, how they’re 
interpreted and what kind of path a research 
programme follows. 

But what if a field picks the wrong para- 
digm? Becker shows how, in the 1950s and 
1960s, a handful of physicists dusted offthe > 
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Work: The Last 1,000 Years 

Andrea Komlosy, trans. Jacob K. Watson & Loren Balhorn VERSO (2018) 
Employment outside the home became the globally dominant form 
of work only after 1900. Yet in policy and discourse, it has eclipsed 
older, more informal modes of production. Social historian Andrea 
Komlosy probes the joins between them in this sweeping chronicle, 
from 1250 — the dawn of globalization and urbanization — to today, 
when international corporations threaten labour standards. This is a 
book teeming with insights, from the contempt for manual labour in 
ancient Greece to the historical tendency for all kinds of subsistence 
tasks to be “housewife-ized” into unpaid domestic labour. 


The Ascent of John Tyndall 

Roland Jackson OXFORD UNIVERSITY PRESS (2018) 

A leading light in climate science, germ theory and magnetism, 
Victorian researcher John Tyndall seemed a man for all sciences. He 
discovered the greenhouse effect and why the sky is blue, and as a 
public intellectual, hobnobbed with physicists Michael Faraday and 
Hermann von Helmholtz, and poet Alfred Tennyson. This splendid 
monument of a biography by Roland Jackson tracks Tyndall’s rise 
from rural Ireland to laboratorial glory days in Britain and Europe. The 
experimentalist’s obscurity now, Jackson avers, may be down in part to 
his death in 1893: he just missed the era of relativity and the quantum. 


——————— Microbia 
Eugenia Bone RODALE (2018) 
At the age of 55, journalist Eugenia Bone returned to university to take 
a microbiology course, reasoning that “humility is the entry point for 
studying nature”. The microbial matrix of life came into focus. Bone’s 
is an exquisitely observant, often amusing voyage from the origins 
of life to antibiotic resistance, punctuated by scientific revelations. 
} (The compound geosmin, produced by Streptomyces bacteria, gives 
soil its sweet smell; bacteria nucleate rain; a quadrillion microbes 
pack part of a cow’s stomach.) The devil’s in the detail, Bone finds, but 
what enchants is the connectivity of this ‘hidden’ ocean. 


Elastic: Flexible Thinking in a Constantly Changing World 

Leonard Mlodinow PANTHEON (2018) 

Physicist Leonard Mlodinow revels in the neuroscientific. Here he 
extends his explorations with an in-depth study of agile creative 
thinking in a hectic age. Among the psychological factors Mlodinow 
isolates are a yen for novelty, and the capacity to reconcile diverse 
ideas. He plunges deep into the human brain, marshalling 
compelling research both on brain basics and on outlier issues such 
as mental blocks. Perhaps most gripping is his take on the “default 
network” — the brain structures that govern richly creative activities 
such as daydreaming, the “dialogue we have with ourselves”. 


Wild Moms 

Carin Bondar PEGASUS (2018) 

Biologist and broadcaster Carin Bondar’s tour of the vagaries of 
motherhood in the animal kingdom is a thrills-and-spills survey, 
from brood parasitism to cooperative breeding. The diversity of 
evolutionary solutions to maternality is mind-bending. We learn, 
for instance, how baby koalas ingest their mothers’ faeces to 
inoculate their gut microbiomes, and that female gastric-brooding 
frogs vomited up their froglets. Ultimately, however, this is less a 
synthesized narrative than a biological litany. Barbara Kiser 
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theories of Einstein and de Broglie and 
turned them into a fully fledged interpreta- 
tion capable of shaking up the status quo. 
David Bohm argued that particles in quan- 
tum systems existed whether observed 
or not, and that they have predictable 
positions and 
motions deter- 
mined by pilot 
waves. John Bell 
then showed 
that Einstein’s 
concerns about 
locality and 
incompleteness 
in the Copen- 
hagen interpretation were valid. It was 
he who refuted von Neumann's proof by 
revealing that it ruled out only a narrow 
class of hidden-variables theories. 

The scientific community greeted 
Bohm’s ideas coolly. A former mentor, 
J. Robert Oppenheimer, said: “if we can- 
not disprove Bohm, then we must agree 
to ignore him”. And, as Becker shows, 
Bohm leftist views led to an appearance 
before the House Un-American Activities 
Committee, and subsequent ostracization. 

Bohm’s contemporary, physicist Hugh 
Everett, delivered another challenge to 
the Copenhagen interpretation. In 1957, 
Everett set out to resolve the ‘measurement 
problem in quantum theory — the con- 
tradiction between the probabilistic nature 
of particles at the quantum level and their 
‘collapse; when measured, into one state at 
the macroscopic level. 

Everett's many-worlds interpretation 
posited no collapse. Instead, probabilities 
bifurcate at the moment of measurement 
into parallel universes — such as one 
in which Schrédinger’s cat is alive and 
another in which it’s dead. Although an 
infinite number of untestable universes 
seems unscientific to some, many physi- 
cists today view the theory as important. 

The bookhas a few minor shortcomings. 
Becker gives too much space to recent 
applications building on Bell’s research, 
and too little to new developments in the 
philosophy of science. Yet he, like cosmolo- 
gist Sean Carroll in his 2016 The Big Picture 
(R. P. Crease Nature 533, 34; 2016), does 
make an explicit case for the importance of 
philosophy. That's a key call, with influen- 
tial scientists such as Neil deGrasse Tyson 
dismissing the discipline as a waste of time. 

What Is Real? is an argument for 
keeping an open mind. Becker reminds 
us that we need humility as we investigate 
the myriad interpretations and narratives 
that explain the same data. m 


Ramin Skibba is an astrophysicist turned 
science writer based in San Diego, California. 
e-mail: raminskibba@gmail.com 
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A Pan Am shuttle prepares to dock at the international Space Station V in this classic scene from the film. 


2001: A Space Odyssey 


Fifty years on, the masterful science-fiction film looks 
more prophetic than ever, reflects Piers Bizony. 


and his screenwriting colleague, 

science-fiction author Arthur C. Clarke, 
presented 2001: A Space Odyssey. Half a 
century later, this unprecedentedly detailed 
speculation about our place in the cosmos 
and our evolving relationship with 
technology is regarded as one of the great 
landmarks in cinema. 

The influence of 2001 on the design 
of subsequent space-film hardware and 
special effects has been pervasive. How- 
ever, in terms of artistic and philosophical 
bravura, it has been a harder act to follow. 
In 2007, director Ridley Scott (of Blade 
Runner and Alien fame) told a Venice Film 


E 1968, film-maker Stanley Kubrick 


Festival audience: 
“After 2001, science 
fiction is dead.” 

The narrative was 
ambitious, to say 
the least. Prehistoric 
apemen have a mind- 
altering encounter with an alien black 
monolith. Four million years later, a giant 
spacecraft is sent to Jupiter on a mysteri- 
ous mission. On board are two astronauts, 
three hibernating scientists and a seemingly 
sentient computer, HAL 9000. Hovering 
above Jupiter, another monolith waits. 

Monoliths aside, 2001 was prescient 
in almost all its detailed predictions of 


Metro-Goldwyn-Mayer: 
1968. 
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twenty-first-century technology. For 
instance, in August 2011, the Samsung 
electronics group began a defence against 
a claim of patent infringement by Apple. 
Who invented the tablet computer? 
Apple claimed unique status for its iPad; 
Samsung presented a frame from 2001. 

Samsung noted that the design claimed 
by Apple had many features in common 
with that of the tablet shown in the film 
clip — most notably, a rectangular shape 
with a display screen, narrow borders, a 
flat front and a thin form. In an era when 
computers still needed large rooms to 
accommodate them, Kubrick’s special- 
effects team rigged hidden projectors to 
enliven devices that looked as though you 
could hold them in one hand. Only the need 
to trim the film’s running length prevented 
ingenious mock-ups of touch-sensitive 
gaming screens and electronic newspapers 
from making the final cut. 

Indeed, 2001 got much right about the 
twenty-first century, including the psycho- 
logical costs of our reliance on technology. 
Kubrick’s astronauts look listless inside 
vehicles perfectly capable of function- 
ing without them. Dialogue in the film is 
deliberately banal. HAL gets all the best 
lines, even as he kills off most of his human 


companions on board the Jupiter-bound 
spaceship. As our machines smarten up, 
we will dumb down, Kubrick suggested. 
The crass human discourse often found on 
today’s algorithmically clever social-media 
platforms seems to bear out his pessimism. 

Certainly, in the film, the surviving 
astronaut’s final conflict with HAL pre- 
figures a critical problem with today’s 
artificial-intelligence (AI) systems. How 
do we optimize them to deliver good out- 
comes? HAL thinks that the mission to 
Jupiter is more important than the safety 
of the spaceship’s crew. Why did no one 
program that idea out of him? Now, we 
face similar questions about the automated 
editorship of our searches and news feeds, 
and the increasing presence of AI inside 
semi-autonomous weapons. 

We also face the startling fact that 
1960s assumptions about the progress of 
AI were optimistic (computing pioneer 
Marvin Minsky served as a consultant for 
2001). No machine available today can 
match HAL’ performance. Just look at 
the YouTube spoofs of “digital personal 
assistants” reimagined as HAL: Kubrick’s 
red-eyed genius outsmarts them all. 

Where the film made technical errors, 
it did so in ways that were flawed rather 
than merely wrong. Kubrick’s machinery 
is a matter for nerdish analysis instead of 
the ridicule usually reserved for outdated 
fantasies. Rocket experts happily argue 
about the film’s docking arrangements 
and control-panel layouts; but none of the 
machinery is out of bounds in terms of 
what we could actually achieve. 

And Kubrick and Clarke confidently 
predicted space projects that still lie many 
years, or decades ahead, 17 years after 
the film’s eponymous dateline. Famously, 


A shuttle flight attendant walks in zero gravity. 
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Kubrick gave us the most persuasive space 
station in all of science fiction, a gigantic 
twin-wheeled “Orbiter Hilton”. It was 
complete with coffee-vending machines, 
garish designer chairs in the lounges, phone 
booths accepting credit cards, and pano- 
ramic windows offering spectacular views 
of the Earth (something so familiar to the 
film’s spacefarers that they ignore it). 

Space planners of the 1960s assumed 
that artificial gravity would be essential. In 
the event, the International Space Station 
was designed for science experiments that 
depend on cancelling out gravity’s influ- 
ence, not replicating it. Now the wheel 
has turned full circle, as we recognize the 
debilitating effects of long-term weight- 
lessness on the human body. Aerospace 
engineers are starting to think about how 
we could make revolving habitats for future 
deep-space missions, or orbiting hotels for 
private adventurers. 

As for such doughty explorers, 
recent triumphs of Elon Musk’s SpaceX 
rocket company validate Kubrick’s use 
of Pan Am as his space-shuttle carrier of 
choice. Although Pan Am no longer exists, 
the corporate conquest of the cosmos is 
under way, just as 2001 predicted. For 
today’s orbital entrepreneurs, the film is not 
just a beautiful piece of science fiction. Itis a 
technical manifesto — an inspirational call 
to arms, around which a real commercial 
space industry is being founded. 

Of course, 2001 also suggests some kind 
of non-terrestrial influence over human 
progress. After unsatisfactory experiments 
trying to depict aliens, Kubrick chose only 
to hint at their presence, allowing those 
black slabs to stand in for — well, for what, 
exactly? 

Today, we know that planetary systems 
orbiting other suns are the rule rather than 
the exception, and that prebiotic molecules 
suffuse the vast, interstellar clouds of dust 
and gas from which new stars and planets 
are born. Have any of these molecules 
sparked into life on other worlds, perhaps 
giving rise to intelligent entities? If so, 
might some of them be more advanced 
than we are? 

Should we watch out for superior 
“aliens” closer to home, and guard against 
AI systems one day supplanting us in the 
evolutionary story yet to unfold? Or does 
the absence of anything like HAL, even 
after 50 years, suggest that there is, after 
all, something fundamental about intelli- 
gence that is impossible to replicate inside 
a machine? Until we know the answers to 
such profound questions, 2001: A Space 
Odyssey cannot stale. m 


Piers Bizony is a science journalist, space 
historian and author. 
e-mail: piersbizony@icloud.com 
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Climate and conflict: 
risk multipliers 


To our knowledge, no one in 

the field of climate research has 
suggested that climate change 
could be the “sole cause” of war, 
violence, unrest or migration 

(see Nature 554, 275-276; 2018). 
We argue that viewing climate 
change instead as a risk multiplier, 
influencer or co-factor can help 
to inform rather than inflame this 
important discussion. 

In this way of thinking, 
environmental and ecological 
factors interact with social 
determinants, including those 
that are economic, demographic 
and political, to produce 
phenomena such as migration, 
conflict and famine (D. C. Bowles 
et al. J. R. Soc. Med. 108, 390-395; 
2015). An example you cite 
found that drought across sub- 
Saharan Africa in 1990-2011 
contributed to an increased risk 
of rioting because it affected 
the region's rain-fed agriculture 
(C. Almer et al. J. Environ. Econ. 
Manage. 86, 193-209; 2017). 
This type of agriculture was 
thus a contributory factor to the 
likelihood of rioting — a risk that 
was then magnified by drought. 

Including such environmental 
factors and multipliers will 
improve understanding of the 
causes of conflict. Without these, 
models for studying conflict 
could become an oversimplified 
form of social determinism. 
Colin D. Butler Australian 
National University, Canberra, 
Australia. 

Ben J. Kefford University of 
Canberra, Canberra, Australia. 
colin. butler 1955@gmail.com 


Climate and conflict: 
don’t oversimplify 


Scientific and methodological 
challenges make it hard to 
evaluate links between climate 
change and human conflict. We 
find the criticisms made ina 
review of these challenges (see 
C. Adams et al. Nature Clim. 
Change 8, 200-203; 2018), as 


outlined in your Editorial (Nature 
554, 275-276; 2018), to bea 
flawed oversimplification. 

One criticism is that the field 
suffers from a potential sampling 
bias because research focuses only 
on regions of violent conflict to 
draw conclusions about the social 
and political effects of climate 
change. In fact, researchers use 
such cases to identify the variety 
of factors that produced these 
conflicts, just as epidemiologists 
study disease outbreaks to 
understand the factors that 
produce epidemics. 

Researchers are also accused of 
ignoring instances of cooperation 
in favour of conflict. That is not 
why studying cooperation is 
important. Comparing triggers of 
cooperation and conflict helps to 
understand human responses to 
environmental stresses. 

We agree that research results 
should not be overgeneralized 
or inappropriately used to justify 
causality between climate and 
conflict. Both practices are rare in 
our experience, contrary to your 
implication. Most researchers 
take pains to describe climate and 
weather merely as ‘contributing 
factors’ to some conflicts (see, 
for example, C. P. Kelley et al. 
Proc. Natl Acad. Sci. USA 112, 
3241-3246; 2015). 

Peter H. Gleick Pacific Institute, 
Oakland, California, USA. 
Stephan Lewandowsky 
University of Bristol, UK. 

Colin Kelley Center for Climate 
and Security, Washington DC, USA. 
pgleick@pacinst.org 


Climate and conflict: 
no stigma 


Your Editorial on sampling bias 
in studies linking climate change 
with civil unrest (Nature 554, 
275-276; 2018) is based on an 
analysis that in our view provides 
no evidence for biased results 

(C. Adams et al. Nature Clim. 
Change 8, 200-203; 2018). 

We disagree with your 
contention that it is “undesirable” 
to study risk factors for 
populations with a high likelihood 


of conflict on the grounds 

that it could “stigmatize” these 
regions as politically unstable. 
The same logic would argue 
against studying risk factors for 
people who have a high chance 
of developing cancer for fear of 
stigmatizing patients. In our view, 
such recommendations could 
create bias in the literature by 
inhibiting research. 

Studies of connections 
between climate and conflict 
should instead be motivated 
to identify causes of human 
suffering so that it can be 
alleviated (see, for example, 

M. Burke et al. Annu. Rev. Econ. 
7, 577-617; 2015). We do not 
believe that shying away from 
investigations in this field is an 
effective path towards this goal. 
Solomon Hsiang University of 
California, Berkeley, USA. 
Marshall Burke Stanford 
University, California, USA. 
shsiang@berkeley.edu 


Details matter for 
lab kit contaminants 


We wish to clarify some points 
related to our reporting of 
facultative pathogenic bacteria in 
a commercially available genetic- 
engineering kit last year (see 
Nature 552, 291; 2017). 

Enterococcus spp., Klebsiella 
pneumoniae and Enterobacter 
spp. are among the ten microbes 
most frequently associated with 
infections in the acute-care wards 
of European hospitals. For users 
of the contaminated kits, the 
risk of infection through broken 
skin or mucosal contact was 
considered ‘low by the European 
Centre for Disease Prevention 
and Control (ECDC) — not ‘very 
low, as you wrote. These bacteria 
were multidrug-resistant and so 
needed special measures for safe 
handling. 

Regarding your implication 
that the Bavarian authorities 
withheld data on the kits, we 
informed all key people involved, 
as well as relevant institutions 
such as the World Health 
Organization and the ECDC, 


about the methods and outcome 
of our analyses. However, the 
official results were not made 
publicly available. 

We stress that the regulatory 
status of biological agents in 
Germany is unambiguous. In this 
case, the bacteria were classified 
as pathogens under the German 
Protection Against Infection 
Actand so their import, export, 
storage and handling needed 
official permission. The German 
Genetic Engineering Act allows 
genetic engineering operations 
only in approved installations. 
Nina Koehler, Armin Baiker, 
Ulrich Busch Bavarian Health 
and Food Safety Authority (LGL), 
Oberschleifheim, Germany. 
ulrich. busch@lgl.bayern.de 
Competing non-financial interests 
declared; see go.nature.com/2hznref. 


Boost children’s 
digital intelligence 


A survey last year by my 
organization of 38,000 children 
across 29 countries (see 
go.nature.com/2fgrnnp), 
revealed that more than 50% of 
8-12-year-olds were exposed to 
at least one cyber-related threat 
such as technology addiction, 
cyber-bullying or identity theft 
(see also C. Odgers Nature 554, 
432-434; 2018). One solution is 
to teach children how to develop 
digital intelligence (DQ) —a 
set of social, emotional and 
cognitive capabilities that helps 
them to minimize the risks and 
pressures of digital media and to 
maximize its opportunities. 

At Singapore’s award-winning 
DQ Institute, we work with 
governments and international 
partners such as the World 
Economic Forum and the 
Organisation for Economic 
Co-operation and Development 
to improve digital education, 
culture and innovation through 
cross-sector collaboration, global 
dialogue and big data research 
(www.dqinstitute.org). 

Yuhyun Park Nanyang 
Technological University, Singapore. 
park@dginstitute.org 
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John Sulston 


(1942-2018) 


Nobel-prizewinning champion of the Human Genome Project and open data. 


he principle that genomic data should 
| be universally shared without com- 
mercial involvement owes its wide- 
spread acceptance largely to John Sulston. 
As leader of the British contribution to the 
international Human Genome Project, 
Sulston persuaded funders and colleagues of 
the crucial importance of making a complete, 
high-quality sequence freely available to the 
global scientific community. His commit- 
ment stemmed from a moral certainty that 
profit as a motive had no place in science. 

In 2002, he shared the Nobel Prize in 
Physiology or Medicine for his contribu- 
tion to understanding how genes control the 
fate of cells in the developing roundworm 
Caenorhabditis elegans. In his work on the 
worm cell lineage and, later, genome sequenc- 
ing, Sulston promoted the idea that invest- 
ing in large-scale data collection without a 
specific hypothesis has long-term benefits. 

Sulston, who died on 6 March, was the 
son of an English clergyman and a teacher. 
He took to heart his parents’ indifference to 
material wealth and their desire to work for 
the common good. Mechanics fascinated 
him from an early age and, by dissecting 
dead animals, he began to see living things, 
too, as machines. He won a scholarship to 
study natural sciences at the University of 
Cambridge, UK. 

His PhD, also at Cambridge, was on the 
synthesis of oligonucleotides — the building 
blocks of nucleic acids such as DNA and RNA. 
His flair as an experimentalist secured him a 
position as a postdoc with organic chemist 
Leslie Orgel at the Salk Institute for Biological 
Studies in La Jolla, California, investigating 
the origins of life. Sulston arrived in 1966 to 
study the replication of nucleic acids, and, for 
the first time he understood the evolutionary 
context of his work in chemistry. 

There, too, Sulston met Francis Crick, co- 
discover of the DNA’s double helix; Crick 
recommended him for a post at the Medical 
Research Council’s Laboratory of Molecular 
Biology backin Cambridge. In 1969, Sulston 
joined a small group headed by geneticist 
Sydney Brenner. Looking for a model organ- 
ism in which to explore the interactions of 
genes, development and behaviour, Brenner 
chose C. elegans. The tiny worm has only 959 
cells as an adult, yet it has a gut, a nervous 
system, gonads and a repertoire of behav- 
iours to move, feed and reproduce. 

It is also transparent. Sulston used what 
became his favourite tool, the Nomarski 
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differential-interference contrast microscope, 
to visualize cell nuclei in living worm larvae 
and, later, in the more challenging embryos. 
He observed and recorded the invariant 
sequence of cell divisions that build an adult 
worm. His work provided a foundation for 
future worm biologists to answer questions 
about development that have implications for 
other species, including our own. 

Sulston spotted that certain cells are elimi- 
nated during development. He began to 
explore the genetics of this process with US 
postdoc, Robert Horvitz. On his return home 
to the Massachusetts Institute of Technology 
in Cambridge, Horvitz discovered genes that 
regulate programmed cell death. Later, muta- 
tions in these genes proved instrumental to 
understanding the uncontrolled multipli- 
cation of cancer cells. Sulston, Horvitz and 
Brenner shared the 2002 Nobel prize “for 
their discoveries concerning genetic regula- 
tion of organ development and programmed 
cell death”. 

From 1983, Sulston set out to map and 
sequence the 100-million-base-pair genome 
of the worm. His lab worked in partnership 
and friendly competition with that of Robert 
Waterston, then at the University of Wash- 
ington in St. Louis. In 1989, their collective 
drive motivated Jim Watson, then head of the 
Human Genome Project, to fund their pilot 
sequencing initiative. The worm project dem- 
onstrated that automated, high-throughput 


sequencing of the human genome was not 
impossible. In 1992, the UK Wellcome Trust 
invited Sulston to head its new sequencing 
facility, the Sanger Centre (now the Sanger 
Institute), in Hinxton. 

Leading the Sanger as part of an interna- 
tional consortium, Sulston had a key role in 
establishing the principles of data release and 
open access. When, in 1998, a private initia- 
tive led by Craig Venter of Celera Genomics 
announced its intent to sequence the human 
genome first, and for commercial profit, 
Sulston defended the principle of open data. 
He became, with Francis Collins — then head 
of the US National Human Genome Research 
Institute — a leading voice in persuading both 
the US National Institutes of Health and the 
Wellcome Trust to commit to the project's 
completion. The human genome sequence, 
finished to a standard of 99.99% accuracy, 
was published in Nature on 21 October 2004 
(International Human Genome Sequencing 
Consortium Nature 431, 931-945; 2004). 

As Sulston predicted, the availability of the 
complete human genome drove research, 
both academic and commercial, worldwide. 
The work is slowly delivering clinical appli- 
cations, especially in areas such as cancer, 
where genetic mutations determine whether 
a tumour will be sensitive to a therapy. Mean- 
while, technology has advanced and costs 
dropped such that whole-genome sequencing 
of individuals might soon be routine. 

Sulston was hands-on in the lab, person- 
ally preparing the worm-clone library for the 
mapping project, and decrypting the early 
sequencing machines so that the electronic 
data could be analysed directly. But he always 
valued collaborators with skills different from 
his own, and co-managed the Sanger on a 
board of seven people. When he asked me to 
co-author a book on the battle for the human 
genome, The Common Thread, he insisted 
that we work as equal partners. 

After the human sequence was published, 
Sulston devoted himself to writing and speak- 
ing in support of open access and, more 
broadly, on the relationship between science 
and society. Warm and articulate, he won over 
audiences with his humility and passion. = 


Georgina Ferry is a science writer and 
biographer in Oxford, UK. She and Sulston 
co-authored The Common Thread: A Story 
of Science, Politics, Ethics and the Human 
Genome (Bantam Press, 2002). 

e-mail: mgf@georginaferry.com 
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BRIEF COMMUNICATIONS ARISING 


Is plasticity caused by single genes? 


ARISING FROM C. K. Ghalambor et a/. Nature 525, 372-375 (2015); doi:10.1038/nature15256 


Whether phenotypic plasticity generally facilitates or hampers adaptive 
evolution is a matter of much debate. By comparing gene expression 
changes in guppy populations, Ghalambor et al.' suggest that ‘adaptive 
plasticity hampers adaptive evolution, whereas ‘non-adaptive plasticity 
facilitates adaptive evolution’. Here, we argue that the classification of 
individual gene expression changes as representing either adaptive or 
non-adaptive plasticity is problematic. Instead of indicating adaptive 
evolution, the expression changes of suites of genes may be caused by 
potentially random changes in underlying regulatory processes. There 
is a Reply to this Comment by Ghalambor, C. K. et al. Nature 555, 
http://dx.doi.org/10.1038/nature25497 (2018). 

Ghalambor et al.' did not directly measure organismal plasticity or 
evolutionary adaptation at the organismal level’, but inferred evolu- 
tionary changes in plasticity from transcriptomic data (gene expression 
profiles). They first selected genes in the derived fish populations that 
are concordantly differentially expressed with respect to the ancestral 
population, thereby accounting for possible non-independencies in 
their expression data. They then continued their analysis by evaluat- 
ing the selected genes one by one, thereby assuming that each gene is 
an independent realization of plasticity and that all convergent gene 
expression changes are adaptive. Yet, genes do not act in isolation to 
shape the organismal phenotype, but are part of regulatory networks*". 
A change in the expression of one gene can propagate through the 
entire network, thereby affecting the expression of many downstream 
regulatory genes, many of which might have no consequences for the 
organismal phenotype. Moreover, changes in the expression of one gene 
strongly depend on the action of other genes’. As a result, divergent 
changes at the gene expression level can be associated with convergent 
changes (or lack of change) at the level of organismal plasticity; whereas 
convergent gene expression changes do not exclude diversity in organ- 
ismal response patterns. To illustrate this, we ran replicate evolutionary 
simulations, in which a simple gene regulatory network was selected to 
launch an adequate organismal response to the presence and absence 
of predators. In each simulation, the same optimal response readily 
evolved, but the network structures mediating this response differed 
substantially across replicate simulation runs (Fig. 1). Thus, without 
accounting for regulatory interactions, it is often impossible to infer 
the organismal consequences of gene expression changes and, hence, 
their adaptive value. 

By stating that selection acts “more strongly to decrease plasticity 
in those transcripts showing the greatest non-adaptive plasticity,” 
Ghalambor and colleagues’ suggest that individual genes can be clas- 
sified as expressing adaptive or non-adaptive plasticity, and that the 
expression changes of each gene between the ancestral and derived 
population resulted from adaptive evolution. This gives the false 
impression that the observed expression changes of many genes can 
be regarded as a large number of independent data points, whereas 
a single regulatory change could generate the same expression pro- 
files. We illustrate this by a simple model that demonstrates how a 
single regulatory change can yield the same results as those shown by 
Ghalambor et al.' without indicating adaptive evolution. To motivate 
this model, we need to have a closer look at their classification of adap- 
tive and non-adaptive plasticity. In their experiment, Ghalambor et al.! 
transplanted Trinidadian guppies (Poecilia reticulata) from a habitat 
with predators to two predatory-free habitats. After a few generations, 
fish from all three locations were reared in the laboratory to examine 
the degree of evolutionary divergence and the plastic responses of fish 


towards the presence and absence of predators. Genes in the ancestral 
population were classified as expressing adaptive plasticity when, in a 
predator-free environment, they exhibit an up- or downregulation in 
the direction of gene expression in the derived populations in this envi- 
ronment, but only if the degree of regulation is weaker in the ancestral 
population. Thus, plastic responses that occur in the same direction in 
both ancestral and derived populations but have a smaller effect size in 
the derived fish are considered to be non-adaptive. According to this 
definition, any factor that weakens the change in gene expression in 
the derived populations would be a source of non-adaptive plasticity. 
Let us assume, for example, that a large set of genes is under (partial) 
control of a regulatory mechanism, which affects the expression of 
these genes in response to different environments. An example is the 
hormone cortisol, which in guppies typically has higher levels in the 
presence than in the absence of predators®. Cortisol can have a major 
influence on gene expression? and it affects organismal phenotypes in 
a variety of ways’. The simplest model for the action of a hormone 
on many genes posits that the expression level of gene i is given by 
G;=B; + w; x H(E) + ¢, in which B; is the baseline expression level of 
this gene, H(E) is the environment-specific level of the hormone, and 
€ subsumes all stochastic effects. The effect w; of the hormone on gene 


Regulatory genes 


Environmental cues 
Phenotypic traits 


Figure 1 | Alternative gene regulatory network with identical plastic 
responses. Networks were selected to show an adequate phenotypic 
response to the presence (cues: E; = 1.5, E, = 2.2, E;=0.3) or absence of 
predators (cues: E, = 1, E,=2, E;=0.5). The optimal response is to express 
three different phenotypic traits as P; = P, =0, P;= 1 in the presence 

of predators, and as P, = P= 1, P;=0 in the absence of predators. The 
networks shown are the result of six replicate evolutionary simulations; 
all networks produce the same optimal phenotypic response, but the 
stimulating (green) and inhibitory (red) interactions within the networks 
and their strength (thickness) differ considerably. In accordance with 
previous models”, we used a gene regulatory network implementation 
with Boolean gene expression and continuous connection weights. 
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Figure 2 | Effect of small hormonal differences on gene expression 
patterns. a, b, In an ancestral and a derived population, the hormone- 
modulated expression pattern of 500 genes was assumed to follow the 
model G;= B; + w; x H(E) + « (see text). All parameters were the same 
in both populations, with the exception of hormone levels in a predator- 
free environment (ancestral population: H = 1.6; derived population: 
H=1.3). Conducting the same analyses as in Fig. 2 and 3 of ref. 1 (panels 
a and b, respectively) produces very similar patterns (of high statistical 
significance) as those reported by Ghalambor et al.’ Parameters: baseline 
levels (which do not affect the results) B; = 0.0; H=0.0 in the presence of 
predators; ¢ normally distributed with mean of 0 and s.d. of 0.1. 


ican be positive or negative and large or small (in our simulation, the 
w; effects are drawn from a standard normal distribution). If we now 
assume that fish from a derived population have a slightly lower hor- 
mone level when reared in the absence of a predator than fish from the 
ancestral population, this simple model reproduces the main results 
of Ghalambor et al.' (compare figures 2 and 3 from ref. 1 to Fig. 2a, b). 
Hence, a single regulatory change provides a much more parsimonious 
explanation for the observed transcription patterns than the large-scale 
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evolutionary changes postulated by Ghalambor et al.' Our alternative 
explanation does not refer to adaptive or non-adaptive plasticity, and 
it does not require evolution of gene regulation patterns (the w; effects 
remain constant). It does not even require adaptive evolution of the 
hormone level, because the lowered hormone level of the derived pop- 
ulations in a predator-free environment might just reflect a random 
change, for example, one caused by a founder effect!!, 

We conclude that the evolutionary analysis of transcriptomics data 
remains a major challenge. Ghalambor et al.' present an intriguing 
experiment to investigate the role of phenotypic plasticity on evolu- 
tionary adaptation, in which they show that gene expression patterns 
can rapidly diverge between independently evolving populations. We 
do not question this rapid divergence or that non-adaptive plasticity 
could potentially underlie this divergence, instead we want to point 
out that one should be cautious when inferring evolutionary changes 
in plasticity from transcriptomic data, especially when the underlying 
regulatory mechanisms are unknown. Because many loci are involved, 
the rapid divergence in gene expression found by Ghalambor et al.' 
probably reflects only a few regulatory changes, making it impossible to 
classify individual genes as expressing either adaptive or non-adaptive 
plasticity. Thus, owing to the complex relation between an organism's 
genotype and phenotype, it is hard to understand adaptive evolution 
by focusing on single genes alone. 

Data Availability All data are available from the corresponding author 
upon reasonable request. 


J. van Gestel+:?3 & F. J. Weissing?4 

1Groningen Institute for Evolutionary Life Sciences, University of 
Groningen, 9747 AG, Groningen, The Netherlands. 

email: f.j.weissing@rug.nl 

Department of Evolutionary Biology and Environmental Studies, 
University of ZUrich, 8057 Zurich, Switzerland. 

3Department of Environmental Systems Science, ETH Zurich, 8092 
Zurich, Switzerland. 

4Netherlands Institute for Advanced Study, 1012 CG Amsterdam, 
The Netherlands. 


Received 21 October 2015; accepted 29 November 2017. 


1. Ghalambor, C. K. et a/. Non-adaptive plasticity potentiates rapid adaptive 
evolution of gene expression in nature. Nature 525, 372-375 (2015). 

2. Merila, J. Evolutionary biology: perplexing effects of phenotypic plasticity. 
Nature 525, 326-327 (2015). 

3. Segré, D., Deluna, A., Church, G. M. & Kishony, R. Modular epistasis in yeast 
metabolism. Nat. Genet. 37, 77-83 (2005). 

4. Davidson, E. H. Emerging properties of animal gene regulatory networks. 
Nature 468, 911-920 (2010). 

5. Wagner, A. Robustness and Evolvability in Living Systems (Princeton Univ. Press, 
2007). 

6. Wagner, A. The Origins of Evolutionary Innovations (Oxford Univ. Press, 2011). 

7. Beukeboom, L & Perrin, N. The Evolution of Sex Determination (Oxford Univ. 
Press, 2015). 

8. Fischer, E. K., Harris, R. M., Hofmann, H. A. & Hoke, K. L. Predator exposure 
alters stress physiology in guppies across timescales. Horm. Behav. 65, 
165-172 (2014). 

9. Aluru, N. & Vijayan, M. M. Stress transcriptomics in fish: a role for genomic 
cortisol signaling. Gen. Comp. Endocrinol. 164, 142-150 (2009). 

10. Becker, J. B., Breedlove, S. M., Crews, D & McCarthy, M. M. Behavioral 
Endocrinology 2nd edn (MIT Press, 2002). 

11. Bonier, F., Martin, P. R., Moore, |. T. & Wingfield, J. C. Do baseline glucocorticoids 
predict fitness? Trends Ecol. Evol. 24, 634-642 (2009). 

12. Spirov, A. & Holloway, D. Using evolutionary computations to understand the 
design and evolution of gene and cell regulatory networks. Methods 62, 39-55 
(2013). 


Author Contributions J.v.G. and F.J.W. both contributed to all aspects of this work. 
Competing Financial Interests Declared none. 


doi: 10.1038/nature25495 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


BRIEF COMMUNICATIONS ARISING 


Contesting the evidence for non-adaptive 


plasticity 


ARISING FROM C. K. Ghalambor et a/. Nature 525, 372-375 (2015); doi:10.1038/nature15256 


The effect of phenotypic plasticity on the evolution of traits is a highly 
controversial subject. Ghalambor et al.' added a new spin to this debate 
as they contrasted gene expression patterns of guppy populations 
evolved at two different predation levels. After four generations 
in a new environment, gene expression plasticity evolved with two 
interesting key features: (1) in environments without predation, gene 
expression in the evolved populations changed in a direction that 
decreases ancestral plasticity; and (2) the ancestral level of plasticity 
and the magnitude of change in plasticity are negatively correlated. 
However, this pattern could be an artefact of the analysis procedure 
rather than reflecting selection, and simple computer simulations that 
assume no divergence in gene expression between populations and only 
stochastic variation are able to replicate the pattern. There is a Reply 
to this Comment by Ghalambor, C. K. et al. Nature 555, http://dx.doi. 
org/10.1038/nature25497 (2018). 

The analysis of genome scale data in evolutionary experiments is 
very challenging because it is important to distinguish genetic drift 
from directional forces (selection). Ghalambor et al.' approach this 
problem in two steps. First, they identify genes that are differentially 
expressed between ancestral and evolved populations in the new 
predator-free environment. Second, they condition on the average 
effect of the two evolved populations to go in the same direction as 
for one additional population that naturally colonized a predator-free 
environment. This step is included to distinguish between genetic 
drift and selection, because only selection is directional. Although we 
agree with the rationale of the testing procedure, we caution that the 
implementation could generate a signal of ‘concordantly differentially 
expressed (CDE)’ genes, even in the absence of selection. 

Assuming no genetically based differences in gene expression and 
random differences in gene expression following a normal distribution, 
we simulated 37,493 transcripts and applied the testing procedure of 
Ghalambor et al.' We show that CDE transcripts can be detected even 
in the absence of selection. As in ref. 1, for CDE transcripts we also 
detect a negative correlation between ancestral plasticity and divergence 
in gene expression (Fig. la). 

This negative correlation is the result of a statistical phenomenon 
known as the regression towards the mean (RTM), which was first 
described by Sir Francis Galton?: he observed that tall parents have, on 
average, children that are smaller than them (and vice versa). In this 
example, the parents at the extreme end of a distribution have children 
whose heights are closer to the mean of the distribution: that is, they 
regress towards the mean. 

In our simulations, all transcripts have the same distribution in both 
environments. CDE transcripts have by chance diverged from this 
mean and thus were kept for further analysis. Yet, when contrasted to 
their expression in the second environment, they will probably regress 
towards the mean. Here, RTM creates the negative correlation as 
illustrated in Fig. 2: because three independent means are required for 
one group, but only one for the other, we expect that the first group 
will have values closer to the mean than the latter one, creating an 
asymmetry. Because random draws of the ancestral population under 
predation conditions are expected to regress towards the mean, the 
initial conditioning of the CDE transcripts causes the observed negative 
correlation (Fig. 2). Using the CDE transcripts, we also reproduce the 


negative correlation between the magnitude of plasticity in the source 
population and the evolved mean change in plasticity (Fig. 1b). As in 
Fig. la, we think that this pattern can be explained by the conditioning 
in the analysis. The more extreme the expression of the ancestral 
population at low predator conditions is, the less deviation from the 
mean is needed for the evolved populations to be significant in the first 
step of the analysis. Hence, drawing a random sample (for the predator 
environment) will result in small differences between the expression 
values for the evolved populations. 
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Figure 1 | Negative correlations between ancestral plasticity and 
divergence in gene expression, and between the magnitude of plasticity 
in the source population and the evolved mean change in plasticity. 

a, b, Negative correlation of the ancestral plasticity with the divergence 

in gene expression (a) and negative correlation of the mean change in 
plasticity in the introduced population with magnitude of plasticity in the 
source population (b) for CDE transcripts assuming no genetic divergence 
among all populations. Both insets show the distribution of Spearman's 
rank correlation for the CDE transcripts from 1,000 random permutations 
of the CDE transcripts. The correlation in the originally simulated (that is, 
non-permuted) dataset (red vertical line) is always more negative than in 
the permuted dataset. 
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Figure 2 | Schematic representation of the negative correlation 

between the changes in gene expression observed in the predator-free 
environment and the ancestral plasticity when all values are randomly 
sampled from a normal distribution. The analysis of Ghalambor et al.’ 
identifies genes expressed at one end of the distribution in the ancestral 
population, and closer to the mean for the three evolved ones (1). As the 
values in the predator environment are independent, they are also grouped 
around the mean and correlate negatively (2) with the evolved response: 
they regress towards the mean. 


We showed that the key results of Ghalambor et al.' could be obtained 
without any genetically based difference in mean expression between 
the populations. Nevertheless, the results of our simulations differ in 
one aspect from the original study—the number of CDE transcripts 
in the simulated data are identical to the ones in the permuted data 
(all are randomly sampled). Reasoning that small differences due to 
sampling or even directional selection could explain this discrepancy, we 
performed additional simulations assuming small random changes in 
mean expression between populations. We confirmed that the number 
of CDE transcripts is higher than in the permuted dataset, exhibiting 
the same negative correlations without selection on plasticity 
(see Supplementary Information). The effect of RTM decreases with 
increasing between-population divergence relative to within-population 
variation’. For the data of Ghalambor et al.!, the effect of RTM is 
difficult to quantify because each transcript has specific mean expression 
values and within and between population variances. Hence, 
we contrast the results of computer simulations using different 
parameter combinations to the experimental data (Supplementary Fig. 4). 
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The experimental data from Ghalambor et al.! were most similar to 
computer simulations in which the between-population divergence 
estimator was lower than the within-population variation; that is, the 
simulation data that show the RTM effect. 

In any case, our simulations were not designed to match the experi- 
mental details of Ghalambor et al.' but instead to highlight that 
even without evolution of gene expression plasticity, we are able to 
recapitulate the key results of their study. Thus, we do not claim that 
the guppies have not evolved their plasticity of gene expression, but 
we feel that the authors would need to provide additional analyses to 
support their conclusions. 


Methods 

We randomly sampled gene expression values from a normal distribution for 
37,943 transcripts in 32 samples. For each population comparison, we computed 
Student's t-test on 250 randomly permuted datasets and identified transcripts for 
which the simulated data were located in the 5% tails of the ¢-statistic distribution. 
We then calculated the mean differences in simulated expression between 
treatments and populations to determine the correlations between ancestral 
plasticity and divergence in gene expression (Fig. 1a) and magnitude of plasticity 
in the source population and evolved mean change in plasticity within introduced 
populations (Fig. 1a). 

Code availability. The R code for the simulations are provided in the 
Supplementary Information. 
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Ghalambor et al. reply 


REPLYING TO J. van Gestel & F. J. Weissing Nature 555, http://dx.doi.org/10.1038/nature25495 (2018); F. Mallard, A. M. Jaksié & C. Schlétterer 


Nature 555, http://dx.doi.org/10.1038/nature25496 (2018) 


The concerns raised in the accompanying Comments by Mallard et al.! 
and van Gestel and Weissing? that natural selection could not generate 
the transcriptomic results reported in our Letter’ overlook the larger 
context of previous work documenting rapid parallel evolutionary 
changes in guppies*-’. Here we also show compelling evidence that 
their alternative interpretations simply do not match our published 
datasets. 

Mallard et al. argue that the negative correlation between plasticity 
and evolution reflects neutral processes rather than selection, but they 
only report simulations using a subset of criteria our conclusions were 
based on; when all criteria are applied, their simulations support our 
conclusions. We based our conclusions on four criteria with analyses 
that appropriately accounted for stochastic variation and non- 
independence of our data: (1) we found more concordantly differen- 
tially expressed (CDE) transcripts than in permuted datasets; (2) we did 
not find more genes that diverged in opposite directions than in per- 
muted datasets; (3) the association between plasticity and divergence 
was more negative than in permuted datasets; and (4) the direction of 
association between plasticity and divergence was more extreme than 
in permuted datasets. Had our data not met both of the first two crite- 
ria, we would not have concluded that our CDE genes were enriched 
for genes evolving under selection and would not have proceeded to 
assess the relationship between ancestral plasticity and divergence. The 
results presented in figures 1 and 2 of Mallard et al.' do not meet cri- 
terion (1) and thus provide no evidence that weakens our conclusions. 
The authors acknowledge this potential flaw and present simulations 
in their supplementary methods that meet criterion (1), yet they omit 
criterion (2) in those analyses. In fact, only 26 of their 400 parameter 
sets replicate all four criteria, and only a fraction of these yield datasets 
with distributions that might reasonably match our results (Fig. 1 and 
ref. 8). Therefore, the simulations of Mallard et al.' are not only 


a Parameters meet criterion (2) b Parameters meet criteria (1)-(4) 
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Figure 1 | A small fraction of simulations performed by Mallard 

et al.' meet all four criteria that support our conclusion that CDE 

genes diverged under selection, and that they diverged in the opposite 
direction as ancestral plasticity. Blue boxes indicate parameter sets that 
met the criteria, and red boxes indicate parameter sets that did not meet 
the criteria. a, Only a subset of parameters meet criterion (2), positing that 
differentially expressed genes that diverge in opposite directions should not 
be overrepresented compared to the permuted datasets. b, All four criteria 
merged together shows that only 26 parameter sets recreate the results found 
in Ghalambor et al.* This contrasts with the much higher fraction claimed 
by Mallard et al.', as they did not consider criterion (2) in their analyses. 


consistent with our interpretation, but also other studies concluding 
that rapid evolution of genes exhibiting non-adaptive plasticity is more 
likely due to selection and very unlikely to arise by chance*”’. 

We agree with van Gestel and Weissing” that individual transcripts 
are not independent, and recognize the robustness of gene expression 
networks to produce similar phenotypes via different mechanisms”. 
We explicitly incorporated non-independence among transcripts in 
designing our permutations to preserve entire transcriptional profiles. 
The model proposed by van Gestel and Weissing” positing a single 
regulatory change underlying patterns in our set of CDE transcripts 
simply does not match our data, as such a model would generate a 
strongly correlated set of CDE transcripts. Correlational analyses show 
that few of our CDE genes are strongly correlated, and that numerous 
clusters of transcripts independently evolved CDE expression patterns 
and negative associations between plasticity and divergence (see ref. 8) 
to compare correlations in experimental data and simulated datasets 
with a single regulatory change). We thus refute the claim that variation 
in a single modulator can account for the rapid evolution of our CDE 
transcripts, just as it is unlikely to explain the rapid parallel evolution 
of other complex phenotypes in guppies. 
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NEWS & VIEWS 


PLANETARY SCIENCE 


Oceans on Mars formed early 


The geometry of putative ancient shorelines on Mars suggests that these features were deformed by the growth of a massive 
volcanic region — a finding that has implications for the climate, geology and hydrology of early Mars. SEE LETTER P.643 


MARIA T. ZUBER 


rguably the most compelling evi- 
Aw that Mars once had oceans is 

the presence of potential shorelines in 
the planet’s northern hemisphere’. However, 
these features do not have a constant elevation, 
which would argue against an oceanic origin’. 
On page 643, Citron et al.’ show that the devia- 
tions can be explained if the shorelines formed 
before and during the growth of Tharsis — a 
volcanic region that spans nearly one-quarter 
of Mars’s surface area. The results imply that 
the formation of the planet’s oceans occurred 
earlier than previously thought, and might 
be linked to the volcanism of Tharsis and the 
formation of valley networks*. 

The history of liquid water on Mars is 
of great interest because water would have 
strongly influenced the planet’s climate and its 
potential for habitability. Evidence for a watery 
past in Mars’s geological record is abundant, 
but many of the data are indicative of cata- 
strophic outflows or intermittent releases that 
neither require a temperate surface environ- 
ment, nor are thought to have persisted long 
enough for the development of microbial life’. 

Researchers have long debated the possibil- 
ity that a liquid-water ocean or oceans once 
filled the topographic depression in Mars’s 
northern hemisphere®. Comparisons of the 
roughness’ and intrinsic electrical proper- 
ties (dielectric properties)* of the proposed 
Martian sea floor with those of similarly 
sediment-laden surfaces on Earth, as well 
as the concentrations of certain isotopes 
on Mars’, support the idea of past oceans. 
But perhaps the most convincing evidence 
comes from geomorphological observations 
of potential ancient shorelines. In particu- 
lar, geological mapping of spacecraft images 
has revealed putative shorelines that can be 
tracked for thousands of kilometres on Mars’s 
northern plains’. 

The identification of these shorelines is not 
without caveats. For instance, the geomorpho- 
logical observations are open to interpreta- 
tion’. Furthermore, shorelines, like beaches on 
Earth, should trace surfaces of constant gravi- 
tational potential (such as sea levels). However, 
Mars’ putative shorelines show changes in ele- 
vation of up to several kilometres, indicating 
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Figure 1 | Mars’s topography and putative ancient shorelines. The surface of Mars contains a massive 
region called Tharsis (red), which was formed by volcanic activity, and lowland areas in the northern 
hemisphere (blue) that have been proposed to be the location of ancient oceans’. Geological mapping’ 
has revealed two extensive potential past shorelines: Arabia (dashed line) and Deuteronilus (solid lines). 
Citron et al.* report that the geometry of these shorelines can be explained if they were deformed in 
response to the growth of Tharsis, rather than after Tharsis’s formation as previously assumed. The colour 
scale denotes elevation in kilometres. (Adapted from Extended Data Fig. 2 of the paper’.) 


that if these features indeed formed at ocean 
coasts, they subsequently underwent substan- 
tial deformation. If the event that produced 
such deformation could be identified, it would 
constrain the relative timing of the formation 
of the shorelines, and therefore of the oceans. 
Citron and colleagues use shoreline model- 
ling to show that the observed deformation can 
be explained by the growth of Tharsis, most of 
which occurred some 3.6 billion or more years 
ago, during an early stage in Mars’s history 
called the Noachian period. The authors con- 
sidered two extensive shorelines: Arabia and 
Deuteronilus (Fig. 1). Their analysis indicates 
that the Arabia shoreline formed before or in 
the early stages of Tharsis’s growth, whereas 
the Deuteronilus shoreline arose during the 
late stages. These findings are in contrast 
to general thinking that the oceans, if they 
existed, formed after Tharsis’s emplacement. 
By correcting for deformation events, Citron 
et al. place lower limits on the ocean volumes 
associated with the Arabia and Deuteronilus 
shorelines, of about 11 and 3 times that of 
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Earth’s Mediterranean Sea, respectively. 

A previous study’® showed that the eleva- 
tion patterns of the Arabia and Deuteronilus 
shorelines could be explained by the deforma- 
tion induced by a reorientation of Mars’s spin 
axis — a phenomenon known as true polar 
wander"'. Rotational dynamics dictates that a 
planet spins stably about its axis of minimum 
inertia. Consequently, the load associated with 
an ocean of the calculated volume could have 
resulted in true polar wander if Tharsis had 
formed far from Mars’s equator. However, 
evidence now indicates that Tharsis prob- 
ably originated near the equator'””, in which 
case, ocean loading would not have been great 
enough to destabilize the planet’s spin axis. If 
Tharsis had formed close to, but not precisely 
at, the equator, loading would have induced 
a limited amount of true polar wander, but 
not enough to explain the deformation of the 
proposed shorelines. 

Citron and colleagues’ scenario is there- 
fore more consistent with our current 
understanding of Tharsis’s emplacement. The 


authors’ revised sequence of events also has 
some appeal, in that it ties ocean formation 
to the release of water-rich gas from massive, 
extended volcanic events. And, in the case of 
the ocean associated with the Arabia shoreline, 
it links ocean formation to the emergence of 
valley networks* — abundant features on early 
Mars that carried water to the northern low- 
lands in which the oceans were situated. 

Not surprisingly, the interplay of water 
and geodynamics over Mars’ history is more 
complicated than presented in Citron and col- 
leagues’ analysis. During the period of interest, 
additional deformational processes occurred 
at local to global length scales that were not 
accounted for. Such processes are associated 
with, for example, mantle activity, impacts of 
asteroids or comets on to Mars’s surface, gla- 
ciers, erosion and individual volcano growth, 
and could also have contributed to the defor- 
mation of the shorelines. 

More-detailed mapping and modelling 
could clarify the contributions to the observed 
elevation changes and refine the sequence and 
relative timing of events. In addition, other, 
much shorter and more fragmented potential 
shorelines have been proposed'*”* that suggest 
the existence of seas and lakes, and that merit 
further study. Finally, the timing and role of 
volcanism, as it relates to the state and evolu- 
tion of the atmosphere and to the persistence 
of water on the planetary surface, is far from 
understood. But, fortunately, the extensive 
database of remote and in situ observations of 
Mars continues to accumulate, revealing the 
intriguing story of Mars’s water-rich past. m 
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A trio of ion channels 
takes the heat 


Of the various temperature-sensitive ion channels identified previously, 
three have now been found to act in concert to detect painful heat and initiate 


protective reflexes. SEE LETTER P.662 


ROSE Z. HILL & DIANA M. BAUTISTA 


here's nothing cosier than sitting by 

the fire on a cold night, and mammals 

have an amazing ability to find the per- 
fect spot in which to warm themselves with- 
out being burnt. This ability is mediated by 
specialized heat-sensitive neurons of the 
somatosensory system that innervate the 
skin. These neurons are fine-tuned to detect 
the temperature at which pleasurable warmth 
turns into painful (noxious) heat, and are 
responsible for initiating protective reflexes and 
triggering avoidance behaviours. The search for 
temperature sensors in somatosensory neurons 
has uncovered several ion channels that are 
part of the transient receptor potential (TRP) 
family’, but the TRP channel or channels 
that are responsible for heat-evoked pain have 
remained unclear. Vandewauw et al.” report 
on page 662 that three TRP channels work 
together to detect noxious heat in mice. 

The first temperature-sensitive TRP 
channel, TRPV1, was discovered more than 
20 years ago’. TRPV1 hasa crucial role in the 
development of hypersensitivity to heat after 
injury or inflammation, but experiments have 
revealed that mice that lack TRPV1 display 
only a partial defect in the ability to sense 
and respond to noxious heat®. Another TRP 
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channel in somatosensory neurons, TRPM3, 
is also activated in response to painful tem- 
peratures; however, heat-evoked pain-avoid- 
ance behaviours are only partially attenuated 
in mice that lack this channel’, leaving the 
mystery of how painful heat is sensed 
unsolved. 

Vandewauw et al. report that double-mutant 
mice, which lack both the TRPV1 and TRPM3 
channels, show only mild defects in heat- 
evoked pain-avoidance behaviours, similar 
to mice that lack just one of those channels. 
By contrast, they find that mice that lack a 
third channel, TRPA1, as well as TRPV1 and 
TRPM3, do not display any protective avoid- 
ance behaviours when exposed to noxious 
heat (Fig. 1). The three channels therefore act 
in concert to mediate behavioural responses 
to noxious heat. 

What led Vandewauw and colleagues to 
implicate TRPA 1 in heat sensing? The authors 
observed that a subset of the heat-responsive 
somatosensory neurons that normally express 
TRPV1 and TRPM3 also express TRPAI, 
which has previously been shown to medi- 
ate inflammatory pain’ and itching’ in mice. 
TRPA1 is known to contribute to the heat- 
sensing abilities of non-mammalian organ- 
isms, including rattlesnakes? and fruit flies!”, 
but mice that lack TRPA1 exhibit normal 
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Figure 1 | Three heat-sensitive TRP channels. Vandewauw et al.’ demonstrate in mice that potentially 
damaging temperatures excite pain-sensing neurons of the peripheral nervous system through the 
activity of three ion channels from the TRP family — TRPV1, TRPM3 and TRPA1. Activation of TRPA1 
(by, for example, hydrogen peroxide, H,O,) is likely to require oxidation of certain amino-acid residues, as 
well as the presence of heat. Activation of the three channels causes neuronal firing and the transmission 
of signals to the central nervous system, leading to pain-associated behaviours and protective reflexes. 
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avoidance of noxious heat”"!. Therefore, the 
protein has not been implicated in heat sensing 
in mammals — until now. 

Activation of this channel by heat in rattle- 
snakes or fruit flies involves specialized 
sequences of amino acids known as ankyrin 
repeats at the amino terminus’’. These repeti- 
tive stretches are different in mammalian 
TRPA1, making the channel insensitive to 
heat under normal conditions”. By contrast, 
activation of TRPA1 in mice by inflammatory 
molecules is dependent on the oxidation state 
of three cysteine amino-acid residues located 
adjacent to the ankyrin repeats’. Vandewauw 
and colleagues therefore proposed that oxi- 
dation of these residues might be sufficient 
to activate the heat-sensing capabilities of 
mouse TRPA1. 

To trigger oxidation of TRPA1 in vitro, the 
authors applied hydrogen peroxide to sensory 
neurons that lacked TRPV1 and TRPM3 but 
expressed TRPAI. They found that hydrogen 
peroxide treatment did sensitize TRPA1 in 
such a way that heat could now excite the neu- 
rons. This suggests that mouse TRPA1 must 
be oxidized to respond to heat. However, the 
authors do not provide a physiological mecha- 
nism for how TRPA1 might be oxidized in vivo 
to participate in heat sensing. This will no 
doubt be an area for future investigation. 

Vandewauw and colleagues also performed 
electrophysiological experiments, which 
revealed that the three TRP channels they 
studied are co-expressed in more than one 
class of somatosensory nerve fibre — including 
at least two types of pain-sensing neuron that 
encode distinct sensations, such as dull pain or 
sharp pain. The expression of these channels 
in different neuronal subtypes might give rise 
to complex circuitry and crosstalk that aids the 
rapid avoidance of noxious heat. In addition, it 
could add a further layer of redundancy to an 
important protective response. 

Finally, the group showed that, although 
the triple-mutant mice do not exhibit pro- 
tective heat-avoidance reflexes, they do 
have the same preference as control mice for 
innocuous warm temperatures (30°C) over 
noxious high temperatures (45 °C) when 
presented with a choice. This finding sug- 
gests that the sensing of painful heat might 
not have a substantial role in determining a 
preference for pleasurable temperatures over 
painful ones, warranting further investiga- 
tion of warmth sensing — an area of active 
debate. A previous study has shown that 
whereas heat-responsive spinal neurons, 
which receive input from heat-sensitive 
somatosensory neurons, respond to abso- 
lute temperatures, cold-responsive spinal 
neurons, which receive input from cold- 
sensitive somatosensory neurons, respond to 
relative changes in temperature”. Together, 
these data imply that the main role of periph- 
eral heat-sensing neurons, such as those 
described by Vandewauw et al., might be 


to mediate avoidance of high temperatures, 
rather than to set an animal’s preferred 
temperature range. 

Circuit-tracing and in vivo imaging will no 
doubt reveal the contributions of different 
subsets of peripheral neurons to the various 
representations of cold, cool, warm and hot 
temperatures in the central nervous system. 
Decoding these circuits will help to unravel 
the nuances of sensory perception and to 
elucidate the basis for diverse temperature 
preference between species or individual 
organisms. 
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AI designs organic 


syntheses 


Software that devises effective schemes for synthetic chemistry has depended 
on the input of rules from researchers. A system is now reported in which an 
artificial-intelligence program learns the rules for itself. SEE ARTICLE P.604 


DEREK LOWE 


ynthetic organic chemistry is the science 

of building desired chemical structures 

from simpler parts. The knowledge and 
experience of researchers has always been 
the key to combining chemical reactions 
into successful synthetic schemes. But on 
page 604, Segler et al.’ report that an artificial- 
intelligence program can design routes for 
synthesizing compounds that — at least, on 
paper — seem just as good as those produced 
by humans. 

Organic chemists often work by thinking 
backwards as much as they do forwards when 
designing a synthetic route. The concept of 
retrosynthesis’, introduced by E. J. Corey in the 
1960s, and for which he was awarded the Nobel 
Prize in Chemistry in 1990, codified the way 
in which many chemists think (Fig. 1). When 
looking at a target molecule, they ask: “What 
could this have been made from? Which bonds 
could have been formed, and which atoms or 
chemical groups could have been added or 
transformed?” Then, the process starts again, 
as researchers try to determine the reactions 
that could have led to the precursor molecule. 
The aim is to work back to easily available 
starting compounds, while balancing the 


592 | NATURE | VOL 555 | 29 MARCH 2018 
© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


factors that make a good synthesis — including 
the number of steps involved, the probable 
product yields of those steps, and the ease of 
use of the chemistry involved. Organic chem- 
ists deal constantly with such questions, for 
example when making compounds for testing 
in drug-discovery programmes. 

Since the birth of synthetic organic 
chemistry in the mid-nineteenth century, a 
huge number of synthetic organic reactions 
have been reported across a literature that 
gets larger every hour. Before the 1980s, many 
chemists kept collections of handwritten, 
cross-referenced index cards containing use- 
ful reactions from the literature, to guide the 
design of synthetic pathways. These aide- 
memoires moved naturally on to digital 
databases as computer technology became 
widespread. 

These days, chemists review the various 
methods for turning chemical group X into 
chemical group Y by drawing the molecular 
structures of interest using a computer pro- 
gram and then performing an online search for 
relevant reactions. This almost invariably pro- 
duces a long list, from which researchers must 
select the most appropriate reaction for their 
needs, according to their knowledge and expe- 
rience. Stringing such reactions into a useful 
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Figure 1 | Retrosynthetic analysis. Chemists use a strategy known as retrosynthesis to design schemes 
for constructing organic molecules. a, The first step is to work out a theoretical pathway by which a 

target molecule is sequentially converted into intermediates that can be easily prepared or purchased. 

In this example, the red bonds and atoms in the target can be derived from the red double bond in the 
intermediate molecule. The red double bond and the blue bonds in the intermediate can, in turn, be 
formed ina single step from two commercially available starting materials (Et, ethyl group). b, The 
second step is to choose reactions that will convert the starting materials into the target molecule: here, 
the Diels—Alder reaction forms the intermediate, which undergoes metal-catalysed oxidation to form the 
target. Segler et al.' report that an artificial-intelligence program has devised plausible synthetic routes for 


molecules by inferring design rules on its own. 


synthesis has been thought of as a problem that 
only humans can solve. 

But does it have to be? Could a sufficiently 
large and well-curated database of chemical 
transformations be used as the basis for a pro- 
gram that not only finds reactions, but also 
arranges them into plausible synthetic plans? 
Such programs have been sought since Corey’s 
work in the 1960s, but (until recently) with 
little practical success. 

Two fundamental problems have frustrated 
the dream. First, computing hardware simply 
could not tackle the scale of the challenge. 
Second, the chemical literature is hard to 
define in terms that a software program can 
understand: given reactions would work for 
the type of compound for which they were 
claimed to work (most of the time), but 
only under certain conditions. For exam- 
ple, group X would turn into group Y, unless 
group Z was present elsewhere in the reactant 
molecules. When group Z was present, the 
reaction might still work if group Q was nearby 
in the same molecule — but only, for instance, 
when the pH was lower than a certain value, 
or when the temperature was high enough, or 
when there was no water present. 

There are various ways to overcome 
this second problem. One is to provide the 
program with an exhaustively human-curated 
list of the reactions that can enable a desired 
chemical transformation, and which takes into 
account all the limitations and conditions. The 
program can then combine such reactions into 
synthetic routes in ways that are broadly simi- 
lar to those used to evaluate combinations of 
chess moves. This approach is starting to yield 


results (see ref. 3, for example), and several 
competing commercial software products are 
available. 

Segler et al. have investigated another 
method: instead of getting researchers to load 
their expertise into a machine, is it possible 
to design a program that learns by itself what 
researchers know? This concept has already 
produced startling results, with programs 
that can learn to play 
games such as Go on 
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plausibility.” matically extracting 
chemical transfor- 
mations from a large commercial database, 
being careful to include only reactions that 
have been reported several times. Their sys- 
tem accepts these well-precedented reactions 
as ‘allowed moves in organic synthesis. When 
the system is asked to devise a synthetic route 
to a target molecule, it works backwards from 
the target as would a human, picking out the 
most promising precursor molecules accord- 
ing to the design rules that it has learnt, and 
then seeing how feasible it is to synthesize 
those. The authors combined three artificial 
neural networks with a random Monte Carlo 
tree search — a type of search algorithm used 
by computers in certain decision-making 
processes — to narrow down the most 
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promising synthetic routes, without getting 
stuck too quickly on a particular path. 

Importantly, the routes that emerged were 
evaluated not only by the program’s scoring 
system, but also by trained organic chemists 
ina blind test for plausibility. When the chem- 
ists were asked to assess machine-generated 
synthetic pathways for target molecules along- 
side routes reported in the literature, they 
expressed no preference for the routes that had 
been shown to work by their fellow research- 
ers. In other words, they found the chemistry 
suggested by the program to be as reasonable 
as the syntheses proposed by researchers. 

This does not necessarily mean that all 
machine-suggested routes will work in the 
laboratory — but, as organic chemists know 
to their sorrow, many routes designed by 
humans fail there, too. Further development 
of the program could include such ‘reduc- 
tions to practice, to determine whether the 
machine-proposed routes are better (or, at 
least, no worse) than those devised by people. 
A study’ this year that assessed a more conven- 
tional, hand-curated retrosynthesis program 
is notable for its inclusion of such a laboratory 
test. Achieving routes that are ‘no worse’ than 
those of researchers is a clear victory for Segler 
and colleagues’ program, which arrives at path- 
ways in considerably less time, and with much 
greater coverage of the literature, than a person 
could manage. 

If such programs fulfil their promise, and 
there is little reason to think that they won't, 
synthetic chemists will find that a mainstay 
of their work starts to disappear. Technologi- 
cal innovations have had similar effects in the 
past, but usually by automating physical ‘grunt 
work’ that is missed by no one. Disconcert- 
ingly, developments in artificial intelligence 
encroach on the thinking part of the job. There 
will always be complex, unusual and unprec- 
edented structures that such software cannot 
handle, but the task of solving more-routine 
synthetic questions will be taken out of the 
hands of researchers. 

The idea that intellectual tasks can be 
categorized as automatable grunt work will 
probably be insulting to many chemists, and 
will certainly feel like a threat. But the use 
of artificial intelligence will actually free up 
time in which they can think more about the 
higher-level questions of which molecules 
should be made and why, rather than focusing 
on the details of how to make molecules. Not 
all researchers will welcome this shift. But it 
seems to be coming, regardless. m 
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Wing origam1 


Origami-based design techniques have 
applications ranging from nanoscale devices 
to architectural structures. But conventional 
techniques have a limited range of folding 
patterns. Writing in Science, Faber et al. 
report an approach that overcomes this 
restriction VU. A. Faber et al. Science 359, 
1386-1391; 2018). 

The authors were inspired by the amazing 
folding abilities of earwig wings (pictured). They 
discovered that protein-rich joints in the wings 
behave like springs, and mimicked this design 
in synthetic structures using mechanical 
springs. Such structures show folding patterns 
and functionalities that were impossible using 
previously available methods. Ryan Wilkinson 


IMMUNOLOGY 


Killer T cells show 


their kinder 


side 


The immune system protects the body by responding to invading organisms. But 
how is an attack on useful resident microbes prevented? A pathway has now been 
identified that allows immune cells to sense and respond to beneficial bacteria. 


PAUL KLENERMAN & GRAHAM OGG 


r | Vhe immune system performs a difficult 
balancing act. It must respond rapidly 
to dangerous microorganisms that 

threaten the host, yet also coexist peacefully 
with the large array of microorganisms that 
colonize barrier surfaces such as those of the 
skin and gut. Writing in Cell, Linehan et al.' 
examine the interactions between immune 
cells and a bacterium that normally resides 
on the skin, and identify a signalling pathway 
that governs the immune response to such 
microbes. 

Most immunological research has focused 
on howimmune cells recognize and eliminate 
viral and bacterial pathogens, and the insights 
obtained have led to the development of many 
immune-based therapies. However, attention 
has also turned to microbial communities at 
barrier surfaces and the long-term influence 
they exert on host immune responses. 
Accordingly, Linehan and colleagues studied 
the skin-dwelling bacterium Staphylococcus 
epidermidis in mice and rhesus monkeys, as 
well as analysing human-tissue samples from 
the clinic. 

Staphylococcus epidermidis belongs 


to a group of microorganisms termed 
commensals, and usually colonizes human 
skin without causing disease. However, in 
premature newborns and people who are in an 
immunosuppressed state or who have received 
artificial joints, S. epidermidis can cause severe 
disease or chronic infection’. Furthermore, 
even though the bacterium is normally harm- 
less, it is not ignored by the immune system. 
In mouse skin, the response® to an initial 
encounter with certain strains of S. epidermidis 
is dominated by a type of immune cell called 
a cytotoxic CD8* T cell, also known as a killer 
T cell. Linehan and colleagues asked two key 
questions about this response, and the answers 
they obtained are illuminating. 

The first question was, how do T cells 
recognize S. epidermidis? A response to bacte- 
ria is thought to occur when a T cell recognizes 
a bacterial peptide fragment bound to the 
‘classical’ class Ia protein component of the 
major histocompatibility complex (MHC), 
which presents such fragments to immune 
cells. However, using a mouse model system, 
Linehan and colleagues found that a ‘non- 
classical’ protein, H2-M3, which belongs 
to the MHC class Ib family, was instead 
involved in MHC presentation in the response 
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to S. epidermidis. H2-M3 can present’ bacterial 
peptides in which the amino-acid residue 
methionine has been modified by addition of 
an N-formyl group. 

The other question Linehan and colleagues 
sought to answer was whether this non- 
classical immune-recognition pathway 
might also be coupled to ‘unconventional’ 
T-cell behaviour. The team investigated this 
by comparing gene-expression patterns of 
CD8* T cells responding to S. epidermidis 
with those of other types of T-cell population 
in the mouse skin that did not respond to the 
bacterium. The authors found that the respon- 
sive CD8* T cells expressed genes associated 
with wound healing and tissue repair (Fig. 1). 
This was surprising because responding CD8* 
T cells would be expected to express genes asso- 
ciated with activation of the pathways involved 
in killing unwanted cells or in the release of 
pro-inflammatory signalling molecules. 

Using an in vivo mouse model of skin- 
wound healing in association with S. epider- 
midis colonization, the authors demonstrated 
that the tissue-repair gene-expression 
signature in responding T cells was linked 
to promotion of a healing response. Thus, 
immune recognition of bacterial residents 
of the skin can induce a form of immunity 
that does not directly retaliate against the 
bacterium, but instead aids tissue repair. 

The authors’ finding adds to a growing 
recognition of the diverse functions of other 
unconventional T cells that are found at barrier 
sites and that can recognize evolutionarily 
conserved microbial molecules. These cells 
include, for example, mucosal-associated invar- 
iant T cells, which are abundant in humans and 
induced by commensals, and which share sev- 
eral features” of the T-cell population studied by 
Linehan and colleagues — such as the produc- 
tion of the signalling molecule interleukin-17A. 


PANTHER MEDIA GMBH/ALAMY 


a b 
Dendritic \ ~ Bacterial CD8*T cell \ A~ N-Formy| bacterial 
\ cell oy f antigen al a \\ _f antigen 
\ | ) a. 
— aA A XN 
wy 1 | Nee 
€ @ @ 
\( MHCla TCR 2° 8 ovtotoxic 
ZA @ ay 
= Cytokine nae ecu: —  Tissue-repair —o 
a CA factors re 
Pathogenic S. epidermidis 
bacterium commensal 


Bacterium killed 


bacterium 


Bacterium survives 


Figure 1 | Response of killer T cells to bacterial infection. a, An immune cell called a CD8" T cell 

(killer T cell) can help to control infection by a pathogenic bacterium. The T-cell receptor (TCR) recognizes 
infection when a bacterial peptide called an antigen is presented on dendritic cells of the immune system by 
an MHC class Ia protein. A conventional T-cell response is then unleashed, and the release of cytokine and 
cytotoxic molecules triggers inflammation and bacterial destruction. b, Linehan et al.’ report an analysis 

of the killer-T-cell response to Staphylococcus epidermidis using mouse models and human cells. This type 
of bacterium, called a commensal, can become a long-term, non-pathogenic resident of the body. Killer 

T cells recognize S. epidermidis when an MHC class Ib molecule presents an antigen containing an N-formy] 
modification of an amino-acid residue. The T cell expresses genes associated with tissue repair, and the 
bacterium survives this ‘unconventional’ response. However, such immune responses are probably not 
neatly divided into these categories, and conventional T-cell responses might in some cases aid tissue repair 
and unconventional T-cell responses might aid host defence against pathogens. 


These and other T-cell populations in humans 
and mice, such as invariant natural killer T cells 
and germline-encoded mycolyl-reactive T cells, 
also act in response to MHC class Ib-mediated 
bacterial-peptide recognition®’. A human 
equivalent to mouse H2-M3 has not been iden- 
tified so far. It is possible that, if such an equiva- 
lent is found, a human T-cell population might 
fulfil a similar role to that of the CD8* T cells 
observed by Linehan and colleagues. 

As the authors note, many of the day-to-day 
interactions between immune-system cells and 


METABOLISM 


microbes are probably with the commensals 
in the skin, gut and airways. Therefore, an 
immune response that maintains the status 
quo, rather than driving bacterial elimination, 
which is usually accompanied by inflamma- 
tion and tissue damage, could be beneficial. 
But should such a response really be considered 
unconventional? Although the roles of classical 
MHC molecules and responding immune cells 
were identified first, non-classical MHC and 
MHC-like proteins are evolutionarily older*” 
than MHC class Ia molecules. Linehan and 


The healthy diabetic 
cavefish conundrum 


Some Mexican cavefish have a mutation in an insulin receptor protein that 
affects blood-glucose regulation. The same mutation causes diabetes and health 
problems in humans, but the diabetic cavefish thrive. SEE LETTER P.647 


SYLVIE RETAUX 


B lind Mexican cavefish (Astyanax mexi- 


canus) live in dark caves, a challenging 
environment that has shaped their evo- 
lution. On page 647, Riddle et al. reveal the 
surprising way in which cavefish regulate their 


glucose metabolism, a finding that might have 
implications for our understanding of human 
diabetes. 

Blood-glucose regulation is essential for the 
body to function normally. In humans, this is 
achieved by a complex network of interactions 
between several organs that is mediated by 
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colleagues reveal a direct link between the role 
of MHC class Ib molecules and tissue homeo- 
stasis. Therefore, these functions might have 
evolved in parallel and represent the broadly 
‘conventional’ functions of the immune system, 
with additional immune-system specializations 
evolving over time. 

The authors’ study hints at the potential 
broad, long-term effects of microbial resi- 
dents on the immune system. If future studies 
mechanistically link individual microorgan- 
isms to specific types of immune response, it 
might be possible not only to understand the 
roles of some abundant and relatively unex- 
plored immune-cell populations, but also to 
determine how resident microbes affect health 
and disease in a way that might lead to new 
treatment approaches. m 


Paul Klenerman and Graham Ogg are in the 
NIHR Biomedical Research Centre, Oxford 
OX1 3SY, UK. BK. is also in the Translational 
Gastroenterology Unit, Nuffield Department 
of Medicine, University of Oxford. G.O. is 

also in the MRC Human Immunology Unit, 
University of Oxford. 

e-mails: paul.klenerman@ndm.ox.ac.uk; 
graham.ogg@ndm.ox.ac.uk 


1. Linehan, J. L. et al. Cell 172, 784-796 (2018). 

2. Sabaté Brescé, M. et al. Front. Microbiol. 8, 1401 
(2017). 

3. Naik, S. et al. Nature 520, 104-108 (2015). 

4. Wang, C.-R. et al. Cel! 82, 655-664 (1995). 

5. Legoux, F., Salou, M. & Lantz, O. Annu. Rev. Cell Dev. 
Biol. 33, 511-535 (2017). 

6. Van Rhijn, |. et a/. Nature Immunol. 14, 706-713 
(2013). 

7. Godfrey, D. |., Uldrich, A. P, McCluskey, J., Rossjohn, J. 
& Moody, D. B. Nature Immunol. 16, 1114-1123 
(2015). 

8. Rodgers, J. R. & Cook, R. G. Nature Rev. Immunol. 5, 
459-471 (2005). 

9. Goyos, A., Sowa, J., Ohta, Y. & Robert, J. J. /mmunol. 
186, 372-381 (2011). 


This article was published online on 26 March 2018. 


hormones and neuropeptide molecules. 
After a meal, glucose levels rise, and B-cells 
in the pancreas release the hormone insulin. 
Insulin binds to its receptor on the surface of 
liver, muscle and fat cells, stimulating them to 
take up glucose from the bloodstream, and 
thereby returning blood glucose levels to nor- 
mal. Conversely, if blood glucose levels drop, 
for example between meals, pancreatic a-cells 
release the hormone glucagon, which stimu- 
lates the liver to break down its reserves of a 
glucose polymer called glycogen. The release 
of this glucose into the bloodstream once again 
returns blood glucose levels to normal. 
Disturbance in these interplays can cause 
severe metabolic disorders, such as diabetes, 
which results in high blood-glucose levels and 
a range of deleterious effects. In type I diabetes, 
B-cells are destroyed by immune cells, and 
insulin is not produced. In type II diabetes, 
insulin is produced, but its levels are too low 
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Figure 1 | A laboratory-raised blind Mexican cavefish (Astyanax mexicanus) from the Tinaja cave 
population. Riddle et al.' report that cavefish from the Tinaja and Pach6n caves in Mexico have high 
blood-glucose levels owing to a mutation in the insulin receptor protein that is identical to one that causes 
diabetes and a range of health problems in humans”®. Surprisingly, these cavefish are healthy and have a 
normal lifespan. 


or it is not recognized by the target tissues. 
This lack of an insulin response is known as 
insulin resistance. 

Although studies of diabetes have focused 
mainly on mammalian systems, other crea- 
tures might offer a fresh perspective on the 
condition. Cavefish certainly look out of the 
ordinary, with their depigmented pink skin 
and eyeless faces (Fig. 1). Hints that these fish 
have unusual metabolic adaptations for coping 
with their lightless and seemingly food-poor 
habitat led Riddle and colleagues to investigate 
them further. Cavefish have evolved behav- 
ioural and physiological traits that differ from 
those of the river-dwelling members of their 
species. Compared with river dwellers, cave- 
fish are fatter, owing to a larger lipid reserve; 
they are better able to cope with starvation’; 
have a lower oxygen consumption and met- 
abolic rate’; and the absence of circadian 
rhythms in their metabolism enables them to 
make a 27% saving in energy usage’. 

Riddle and colleagues now reveal another 
metabolic surprise. Cavefish have higher blood 
glucose levels than the river dwellers of the spe- 
cies. The authors’ investigation revealed that 
cavefish are insulin resistant and have a mutant 
version of the insulin receptor, encoded by the 
insra gene, that does not bind and transduce 
the insulin signal. This mutation, in which the 
usual amino-acid residue proline is replaced 
by leucine, is identical to a change that causes 
diabetes and health problems in humans”. 

To investigate the effect of this mutation 
in the fish, the authors used the CRISPR- 
Cas gene-editing technique to generate 
zebrafish (Danio rerio) carrying mutated 
insra. This experiment would perhaps have 
been even more informative if performed in 


a river-dwelling A. mexicanus, but the results 
are clear: if both the maternally and paternally 
inherited copy of insra have the mutation, the 
fish are insulin resistant and overweight. The 
insulin resistance makes sense, given what is 
known about this mutation in humans, but 
the weight gain is puzzling and unexpected, 
because insulin is a growth hormone. In 
humans and rodents, lack of a functional insu- 
lin receptor is associated with growth retarda- 
tion and low body fat’. 

Does this mean that insulin signalling 
has divergent roles in fish and mammals? 
Zebrafish and A. mexicanus cavefish and river 
dwellers have two versions of the insulin recep- 
tor gene, called insra and insrb. This raises the 
question of why the receptor encoded by insrb 
doesn't compensate for the non-functional 
receptor encoded by the mutant version of 
insra. In zebrafish, the two receptors have over- 
lapping roles in growth control and glucose 
metabolism through the action of the insulin 
signalling pathway, but they have opposing 
effects on lipid metabolism, and the receptor 
encoded by insra promotes lipid synthesis*”. 
However, in cavefish and in the gene-edited 
zebrafish, the insra gene is mutated, making 
this weight gain even more puzzling. 

Perhaps one of the strangest findings is 
that cavefish have a respectable lifespan of 
more than 14 years, which is similar to that 
of the river-dwelling form. Moreover, they 
show fewer signs of age-related tissue decline. 
Elevated blood-glucose levels can damage 
tissue if the glucose bonds with proteins in a 
process termed glycation, which is linked to a 
range of health problems in people who have 
diabetes. However, cavefish do not have high 
levels of glycated proteins, so they must have 
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evolved a compensatory mechanism. One way 
to unmask this mechanism might be to study 
cavefish in which gene editing is used to cor- 
rect the insra gene mutation. The authors will 
undoubtedly be working on this, given the 
potential biomedical importance of identifying 
a glycation-inhibiting mechanism. 

Further genetic analysis by the authors 
reveals the complexity of the mechanism that 
underlies the elevated-glucose phenomenon. 
The beauty of studying A. mexicanus is that 
the cavefish and river dwellers can interbreed. 
The authors therefore studied the patterns of 
inheritance of the elevated-glucose trait, and 
found that insra is not the only gene involved. 
In the offspring of interbreeding crosses, all 
of the fish that had high blood glucose had 
the cavefish insra mutation. However, some 
of the fish with this mutation did not have 
high blood glucose, indicating that the insra 
mutation is necessary but not sufficient to alter 
blood glucose regulation. The identification of 
other genes involved might shed light on the 
weight-gain phenomenon. 

About 30 caves in Mexico are home to 
cavefish colonies’. Some of these populations 
arose from independent colonization events 
by river-dwelling ancestors. The authors 
compared three cavefish populations — two 
from the Tinaja and Pachon caves, which are 
in the same mountain range, and the third 
from the Molino cave in another mountain 
range. The authors found that all of these 
populations have high blood-glucose levels, 
but only those from Tinaja and Pachon have 
the insra mutation. Another mechanism must 
be at work in fish from Molino. The authors’ 
analyses suggest that the insulin signalling 
pathway is unaffected in these fish, which also 
have high levels of glycated proteins. Molino 
cavefish have lived for more than 13 years in 
our laboratory, suggesting that they also cope 
well with their unusual metabolism. 

The convergent evolution of this extra- 
ordinary blood-glucose dysregulation, 
through different mechanisms in fish from dif- 
ferent caves, seems counter-intuitive in terms 
of species fitness. What compensates for this 
surprising evolutionary path? The authors 
suggest that high fat reserves might help the 
fish to survive during extended periods of food 
deprivation. 

However, perhaps something else is at play. 
Caves are probably not as food-poor as they 
seem. In the Tinaja cave (home of the main 
population investigated by Riddle and col- 
leagues), the cavefish feed on mud that has a 
high carbon content''. Moreover, a substantial 
proportion of the fish in the Tinaja and Pachon 
populations have only one copy of the mutant 
version of insra. If a beneficial mutation has 
not spread throughout an entire population, 
one possible interpretation is that the muta- 
tion arose relatively recently in the popula- 
tion’s history. This would imply that the early 
cave settlers did not have the benefit of being 
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fat and starvation-resistant. Exciting times lie 
ahead for explorations of the metabolic and 
adaptive changes that occurred as different 
cavefish populations evolved, and such studies 
might uncover the underlying evolutionary 
forces responsible for this striking metabolic 
adaptation. = 
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The tornadoes of 
sudden cardiac arrest 


A clever combination of techniques has enabled, for the first time, simultaneous 
visualization of the 3D waves of electrical and mechanical activity that are 
responsible for many cases of sudden cardiac death. SEE LETTER P.667 


JOSE JALIFE 


udden cardiac arrest is a common cause 

of death in people with coronary artery 

disease’, and also kills many young people 
who have heritable heart diseases. In both 
cases, sudden death often occurs because of a 
heart-rhythm defect called ventricular fibril- 
lation?’, in which the heart rate increases dra- 
matically and cardiac-muscle contractions 
in the heart’s ventricular chambers become 
uncoordinated, rendering the heart unable to 
pump blood. Blood pressure decreases, leading 
to unconsciousness, and death follows unless 
a defibrillating electrical shock is applied. 
The mechanism underlying this anomaly 
has been debated for more than a century. 
On page 667, Christoph et al.* show that the 
use of panoramic fluorescence imaging with 
a voltage-sensitive dye, combined with ultra- 
sound imaging, could considerably advance 
our understanding of ventricular fibrillation. 
The study has potential implications for a 
broad range of researchers, from physicists to 
cardiologists. 

Normal heart rhythm is maintained by 
cyclic changes in the electrical currents that 
drive and coordinate heart-muscle contrac- 
tion. But during ventricular fibrillation, elec- 
trical impulses stop tracking their normal 
paths across the heart, and instead adopt a 
complex, vortex-like pattern similar to eddies 
in water or a tornado in the atmosphere. Fluo- 
rescence imaging of the voltage on the heart's 
surfaces has revealed’ that these electrical 


vortices involve highly periodic, spiralling 
waves of activity organized around a central 
point called a rotor. The rotor, which is the 
organizing centre of fibrillation’, spins rapidly 
and can meander across the heart’s surface, 
generating turbulent, wave-like behaviour. 

Until now, fluorescence imaging has allowed 
the visualization of rotors only on the surfaces 
of the heart's chambers, and so activity inside 
the ventricles has been inferred through com- 
puter simulations’. The 3D equivalent of 2D 
spiral waves (called scroll waves) and the fil- 
ament-shaped rotors at their centres (vortex 
filaments) could only be reconstructed using 
simultaneous 2D video images of spiral waves 
on the outer and inner surfaces of the heart’. 
This is problematic, because it has not been 
possible to prove that the electromechanical 
changes that occur in the heart proper dur- 
ing ventricular fibrillation match predictions 
made by simulations. For example, cardiac 
defibrillation by an electrical shock is the 
only known treatment for ventricular fibrilla- 
tion, but our understanding of how it works 
is currently based on numerical predictions. 

Christoph et al. have overcome this hurdle, 
thanks to a clever combination of techniques. 
The authors kept isolated pig and rabbit hearts 
alive by perfusion with a warm solution through 
the coronary arteries. They induced ventricular 
fibrillation in the hearts, and used high- 
resolution 4D ultrasound imaging to document 
changing mechanical strain over time. 

The group then combined these data with 
more-conventional 2D fluorescence imaging, 
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50 Years Ago 


Throughout this week an unusual 
meeting has been taking place 
near Paris ... experts from all over 
France — and some from other 
parts of the world — have been 
discussing what the world will be 
like in the year 2020. To keep the 
discussion within sensible limits, 
it has been set ina geographical 
context so that each day has 
been spent discussing the way 

in which different technological 
developments will affect the way 
land is used. It is clear that this 
meeting ... is intended to be 

the first of a series which could, 
no doubt, last long enough for 
the results of this meeting to 

be compared with the reality 

of 2020. 

From Nature 30 March 1968 


100 Years Ago 


Another Indian “miracle” has 

been explained by scientific 
investigation. The Pioneer Mail 

of January 11 reports a lecture 

by Sir J. C. Bose on “The Praying 
Palm Tree” of Faridpur. While 

the temple bells call the people 

to evening prayer, this tree has 
recently been seen to bow down in 
prostration, and to erect its head 

on the following morning. Large 
numbers of pilgrims have been 
attracted to the place, and offerings 
to the tree are said to have been 

the means of effecting marvellous 
cures. Sir J. C. Bose first procured 
photographs which proved the 
phenomenon to be real. The 

next step was to devise a special 
apparatus to record continuously 
the movement of the tree by day and 
night. The records showed that it fell 
with the rise of temperature and rose 
with the fall. The records obtained 
in the case of other trees brought 
out the fact that all the trees are 
moving, each movement being due 
to changes in their environment. 
From Nature 28 March 1918 
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Figure 1 | Electromechanical vortices in a pig heart. In a deadly heart-rhythm defect called 
ventricular fibrillation, the electrical impulses that normally coordinate heartbeats become erratic. 
Christoph et al.’ visualized these erratic electrical impulses, and the corresponding changes in 
mechanical strain, in 3D in isolated rabbit (not shown) and pig hearts. In line with previous predictions 
based on 2D data, the authors observed coexisting electrical and mechanical vortex filaments — areas 
of constant activity around which 3D ‘scroll waves’ of electromechanical activity swirl in rapid, highly 
periodic cycles (differences in activity indicated in different colours). The scroll wave shown here is a 


simplified schematic based on Figure 2 of the paper’. 


in which they took simultaneous measurements 
of three factors during each heartbeat: changes 
in electrical potential across cell membranes; 
changes in intracellular calcium-ion (Ca?*) 
levels; and muscle contraction. The first two 
factors provide information about the molecu- 
lar coupling between electrical and mechanical 
activity. Because this type of imaging can easily 
be distorted by spurious mechanical move- 
ments, Christoph et al. also took the crucial 
step of using computational motion-tracking 
techniques to correct any errors made during 
imaging. Together, these data enabled them 
to generate 3D maps of how mechanical and 
electrical waves propagate through the heart 
during ventricular fibrillation (Fig. 1). 

Next, the authors generated computer simu- 
lations of electromechanical wave dynamics in 
virtual hearts that had realistic heart geometry 
based on computerized-tomography scans 
and information about muscle-fibre orienta- 
tion, electrical impulses, electromechanical 
coupling and muscle contraction. From these 
simulations, they resolved more details of the 
dynamics of 3D scroll waves inside the con- 
tracting muscle. They found that coexisting 
electrical and mechanical vortex filaments 
occurred from the innermost to the out- 
ermost linings of the heart, and displayed 
complex patterns of intrinsically coupled 
electrical and mechanical waves that swirled 
at a high frequency through both ventricles. 

Christoph and colleagues’ findings 
are noteworthy because they confirm long- 
held predictions about the electromechanical 
behaviour of the heart during ventricular fibril- 
lation. This first experimental observation of 
electrical and mechanical vortex filaments and 


their coupled dynamics also provides proof 
of a concept that began to emerge more than 
60 years ago” * — that 3D, vortex-like phenom- 
ena that occur in certain chemical reactions 
in vitro also cause fibrillation in vivo. 

When this concept first began to emerge, it 
opened up a completely new field of cardiac 
electrophysiology’*""*, which is rapidly 
expanding today. Christoph and colleagues’ 
study has the potential to contribute to this 
rapid advance. The tools that the authors 
used to identify mechanical vortex filaments 
are readily available in most cardiology units. 
They could theoretically be used to advance 
our understanding of cardiac defibrillation 
and possibly aid the development of pharma- 
cological therapies for ventricular fibrillation. 
Before long, the authors’ method might be 
used to monitor fibrillating human hearts 
from the body’s surface, enabling investiga- 
tion of the mechanisms underlying ventricular 
arrhythmias in patients who have structurally 
normal hearts — such as young people who 
have heritable heart diseases. However, more 
animal studies and further technical develop- 
ments are likely to be needed to translate this 
strategy to people whose hearts have been 
scarred bya heart attack’. 

More work will also be required before 
the imaging technology can be applied to 
fibrillation in the heart’s atrial chambers. 
Atrial fibrillation, like ventricular fibrillation, 
is maintained by high-frequency rotors, and 
it is the most common cardiac arrhythmia in 
humans. Unfortunately, currently available 
ultrasound probes lack sufficient temporal 
and spatial resolution to study atrial arrhyth- 
mias. Christoph and colleagues’ study should 
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stimulate further advancement of technologies 
that open the path towards the study and 
diagnosis of atrial fibrillation in patients. 

Another caveat to clinical translation, 
acknowledged by Christoph and co-workers, 
is that the generation of mechanical waves 
requires transient release of Ca’* from intra- 
cellular stores into the cell cytoplasm each 
time the cell receives electrical excitation. 
The amplitude of the Ca” transient, and of 
the mechanical contraction generated by it, 
depends on the rate of cell excitation. Ca”* 
transients are readily observed at relatively 
low frequencies of excitation. But sometimes 
in ventricular fibrillation, the frequency is 
too high for Ca" release. In these cases, Ca** 
transients might disappear, or at least dimin- 
ish substantially’. Consequently, mechanical 
waves might also diminish or disappear. This 
would result in dissociation between the elec- 
trical and mechanical waves, which, in turn, 
would impair the visualization of electro- 
mechanical wave dynamics inside the heart 
wall, because (except in very invasive experi- 
mental conditions) electrical waves can be 
visualized only from the surface. 

Despite these limitations, imaging of 
electromechanical vortex filaments during 
cardiac fibrillation is an exciting technologi- 
cal accomplishment. Christoph et al. have 
observed, for the first time, complex inter- 
actions between electrical and mechanical 
rotors in 3D, and characterized their dynam- 
ics in detail. This important contribution is 
likely to lead to both scientific and therapeutic 
advances. ™ 
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A coherent spin- photon interface 


in silicon 


X. Mi, M. Benito’, S. Putz!, D. M. Zajac!, J. M. Taylor’, Guido Burkard? & J. R. Petta! 


Electron spins in silicon quantum dots are attractive systems for quantum computing owing to their long coherence times 
and the promise of rapid scaling of the number of dots in a system using semiconductor fabrication techniques. Although 
nearest-neighbour exchange coupling of two spins has been demonstrated, the interaction of spins via microwave- 
frequency photons could enable long-distance spin-spin coupling and connections between arbitrary pairs of qubits 
(‘all-to-all’ connectivity) in a spin-based quantum processor. Realizing coherent spin-photon coupling is challenging 
because of the small magnetic-dipole moment of a single spin, which limits magnetic-dipole coupling rates to less than 
1 kilohertz. Here we demonstrate strong coupling between a single spin in silicon and a single microwave-frequency 
photon, with spin-photon coupling rates of more than 10 megahertz. The mechanism that enables the coherent spin- 
photon interactions is based on spin-charge hybridization in the presence of a magnetic-field gradient. In addition to 
spin-photon coupling, we demonstrate coherent control and dispersive readout of a single spin. These results open up a 
direct path to entangling single spins using microwave-frequency photons. 


Solid-state electron spins and nuclear spins are quantum mechanical 
systems that can be almost completely isolated from environmental 
noise. As a result, they have coherence times as long as hours and so 
are one of the most promising types of quantum bit (qubit) for con- 
structing a quantum processor’~*. On the other hand, this degree of 
isolation poses difficulties for the spin-spin interactions that are needed 
to implement two-qubit gates. So far, most approaches have focused 
on achieving spin-spin coupling through the exchange interaction 
or the much weaker dipole-dipole interaction’ ®. Among existing 
classes of spin qubits, electron spins in gate-defined silicon quan- 
tum dots have the advantages of scalability due to mature fabrication 
technologies and low dephasing rates due to isotopic purification’. 
Currently, silicon quantum dots are capable of supporting fault-toler- 
ant control fidelities for single-qubit gates and high-fidelity two-qubit 
gates based on exchange*!”. Coupling of spins over long distances 
has been pursued through the physical displacement of electrons'*-'® 
and through ‘super-exchange’ via an intermediate quantum dot'”. The 
recent demonstration of strong coupling between the charge state ofa 
quantum-dot electron and a single photon has raised the prospect of 
strong spin—photon coupling, which could enable photon-mediated 
long-distance spin entanglement!*°. Spin-photon coupling may be 
achieved by coherently hybridizing spin qubits with photons trapped 
inside microwave cavities, in a manner similar to cavity quantum 
electrodynamics with atomic systems and circuit quantum electro- 
dynamics with solid-state qubits'?*°. Such an approach, however, is 
extremely challenging: the small magnetic moment of a single spin 
leads to magnetic-dipole coupling rates of 10-150 Hz, which are far too 
slow compared with electron-spin dephasing rates to enable a coherent 
spin—-photon interface”. 

Here, we resolve this outstanding challenge by using spin-charge 
hybridization to couple the electric field of a single photon to a single 
spin in silicon*?*!-**, We measure spin—photon coupling rates g,/(27) 
of up to 11 MHz, nearly five orders of magnitude higher than typical 
magnetic-dipole coupling rates. These values of g,/(27) exceed both 
the photon decay rate «/(27) and the spin decoherence rate y,/(27), 


firmly anchoring our spin-photon system in the strong-coupling 
regime26930, 

Our coupling scheme consists of two stages of quantum-state 
hybridization. First, a single electron is trapped within a gate-defined 
silicon double quantum dot (DQD) that has a large electric-dipole 
moment. A single photon confined within a microwave cavity 
hybridizes with the electron charge state through the electric-dipole 
interaction****. Second, a micrometre-scale magnet (micromagnet) 
placed over the DQD hybridizes electron charge and spin by producing 
an inhomogeneous magnetic field?'~*4. The combination of the 
electric-dipole interaction and spin-charge hybridization gives rise 
to a large effective spin—photon coupling rate. At the same time, the 
relatively low level of charge noise in the device ensures that the effec- 
tive spin decoherence rate 7), remains below the coherent coupling rate 
&s—a criterion that has hampered previous efforts to achieve strong 
spin—-photon coupling*”. 

As well as demonstrating a coherent spin—photon interface, we also 
show that our device architecture is capable of single-spin control 
and readout. Single-spin rotations are electrically driven?** and the 
resulting spin state is detected through a dispersive phase shift in the 


cavity transmission, which reveals Rabi oscillations*®. 


Spin-photon interface 

The device that enables strong spin-photon coupling is shown in 
Fig. la and contains two gate-defined DQDs fabricated using an over- 
lapping aluminium gate stack (Fig. 1b). The gates are electrically biased 
to create a double-well potential that confines a single electron in the 
underlying natural-silicon quantum well (Fig. 1c). A plunger gate (P2) 
on each DQD is connected to the centre pin of a half-wavelength nio- 
bium superconducting cavity with a centre frequency of f= 5.846 GHz 
and quality factor of Q.= 4,700 (K/(27) =f-/Q.= 1.3 MHz), which 
hybridizes the electron charge state with a single cavity photon through 
the electric-dipole interaction'®*°***°, Because the spin-photon 
coupling rate g, is directly proportional to the charge-photon coupling 
rate g (refs 25, 31-34, 39-41), we have modified the cavity dimensions 
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Figure 1 | Spin-photon interface. a, Optical image of the superconducting 
microwave cavity. The inset shows an optical image of the centre pin 

(0.6 zm) and vacuum gap (201m) of the cavity. b, False-colour scanning 
electron micrograph (SEM) of a DQD. Gate electrodes are labelled as G1, 
G2, S, D, B1, P1, B2, P2 and B3, where G1 and G2 are screening gates, 

Sand D are used for accumulating electrons in the source and drain 
reservoirs, and B1 and B3 control the tunnel barrier of each dot to its 
adjacent reservoir. The locations of the cobalt micromagnets are indicated by 
the orange dashed lines. c, Schematic cross-sectional view of the DQD 
device. The blue line indicates the electrostatic confinement potential which 
delocalizes a single electron between the two dots (indicated as half-filled 
circles). The quantization axis of the electron spin (red arrow) changes 
between the dots. d, Cavity transmission amplitude A/Apo at f=f., where f. is 
the centre frequency of the cavity, near the (1, 0) < (0, 1) inter-dot transition 
for DQD1, plotted as a function of the voltages on gates P1 and P2, Vp; and 
Vp2, with BS = 0 and Vg: =710 mV. The dashed arrow denotes the DQD 
detuning parameter ¢, which is equal to the difference in the chemical 
potentials of the two dots and points along the vertical direction because in 
this work Vp, is changed to vary €. Vp2 denotes the voltage on gate B2, which 
controls the inter-dot tunnel coupling t.. e, A/Ao as a function of ¢ with 

Vp2 = 710 mV (red line) and a fit to cavity input-output theory (black 
dashed line), with g./(2m) =40 MHz. f, 2t,/h as a function of Vg2 for DQD1, 
obtained by measuring A(¢)/Ap at different values of Vp. 


(Fig. 1a, inset) to achieve a high characteristic impedance Z, and there- 
fore a high g. (g, x ./Z,; ref. 20). To hybridize the charge state of the 
trapped electron with its spin state, a cobalt micromagnet is fabricated 
near the DQD, which generates an inhomogeneous magnetic field. For 
our device geometry, the magnetic field due to the cobalt micromagnet 
has a component along the z axis B™ that is approximately constant for 
the DQD and a component along the x axis that takes on an average 
value of Bey (Bea) for the left (right) dot (Fig. 1c, Extended Data 
Fig. 1). The relatively large field difference BY, — By’, =2B%' leads to 
spin-charge hybridization, which, when combined with charge-photon 
coupling, gives rise to spin-photon coupling****. 

We first characterize the strength of the charge—photon interac- 
tion, because this sets the scale of the spin-photon interaction rate. 
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For simplicity, only one DQD is active at a time for all of the measure- 
ments presented here. The cavity is driven by a coherent microwave 
tone at frequency f= f, and power P ~ —133 dBm (corresponding 
to approximately 0.6 photons in the cavity, determined on the basis 
of AC Stark shift measurements of the spin-qubit frequency in the 
dispersive regime; see Extended Data Fig. 2)”. The normalized cavity 
transmission amplitude A/Ao is displayed in Fig. 1d as a function of 
the voltages Vp; and Vp on gates P1 and P2 of the first DQD (DQD1), 
which reveals the location of the (1,0) < (0, 1) inter-dotcharge transi- 
tion (see Extended Data Fig. 3 for overall stability diagrams) !*?%5°, 
Here (n, m) denotes a charge state, with the number of electrons in the 
left (P1) and right (P2) dot being and m, respectively. The charge- 
photon coupling rate is estimated quantitatively by measuring A/Ao as a 
function of the DQD level detuning ¢ (Fig. le). By fitting the data with 
the cavity input-output theory model using «/(27) = 1.3 MHz, we find 
gc/ (27) =40 MHz and 2t,/h = 4.9 GHz, where tf; is the inter-dot tunnel 
coupling and h is the Planck constant'?*°?”, A charge decoherence rate 
of ¥./(27) = 35 MHz is also estimated from the fit and confirmed inde- 
pendently using microwave spectroscopy with 2t./h = 5.4 GHz (refs 19, 
20, 42). Fine control of the DQD tunnel coupling, which is critical for 
achieving spin-charge hybridization*’, is shown in Fig. 1f, in which 
2t,/h is plotted as a function of the voltage Vg, on the inter-dot barrier 
gate B2. A similar characterization of the second DQD (DQD2) yields 
gol (2%) =37 MHzand 4,/(27) =45 MHzat the (1, 0) < (0, 1) inter-dot 
charge transition. Owing to the higher impedance of the resonator, the 
values of g- measured here are much larger than in previous silicon DQD 
devices!?“3, which is helpful for achieving strong spin-photon coupling. 
In general, there are device-to-device variations in 7, (refs 19, 43). 
It is unlikely the slightly higher charge decoherence rate is a result of 
our cavity design, because the Purcell decay rate’? is estimated to be 
TJ (2%) 0.02 MHz < 7,/(27). Excited valley states are not visible in 
the cavity response of either DQD, suggesting that they have negligible 
population™. We therefore exclude valleys from the analysis below. 


Strong single spin-photon coupling 

We now demonstrate strong coupling between a single electron spin 
and a single photon, as evidenced by the observation of vacuum Rabi 
splitting. Vacuum Rabi splitting occurs when the transition frequency 
of a two-level atom f, is brought into resonance with a cavity photon of 
frequency f- (refs 21, 23). Light-matter hybridization leads to two 
vacuum-Rabi-split peaks in the cavity transmission. For our single-spin 
qubit, the transition frequency between two Zeeman-split spin states 
is f. © Ez/h, where Ez =g/tpBior is the Zeeman energy and the approxi- 
mate sign is due to spin-charge hybridization, which shifts the qubit 
frequency slightly. Here g is the g-factor of the electron, jig is the Bohr 


magneton and By =  [((BM, + BM,)/2) + (BM + BS)? is the total 


magnetic field. To bring f, into resonance with f,, we vary the external 
magnetic field BS“ along the z axis while measuring the cavity trans- 
mission spectrum A/Ag as a function of the drive frequency f (Fig. 2a). 
Vacuum Rabi splittings are clearly observed at BS = — 91.2 mT and 
BS“ = 92.2 mT, indicating that E7/h =f at these field values and that 
the single spin is coherently hybridized with a single cavity photon. 
These measurements are performed on DQD1, with 2t./h =7.4GHz 
and ¢=0. The dependence of g, on € and f, is investigated below*!. 
Assuming g=2 for silicon, we estimate that an intrinsic field of about 
120 mT is added by the micromagnet, comparable to values found in a 
previous experiment using a similar cobalt micromagnet design’. 

To further verify the strong spin-photon coupling, we plot the cavity 
transmission spectrum at BS'=92.2mT (Fig. 2b). The two 
normal-mode peaks are separated by the vacuum Rabi frequency 
2g,/(2%) = 11.0 MHz, giving an effective spin-photon coupling rate of 
&5/(2%) =5.5 MHz. The photon decay rate at finite magnetic field is 
extracted from the line width of A/Ay at BS" = 90.3 mT, at which E7/h 
is largely detuned from f,, yielding «/(27) = 1.8 MHz. A spin decoher- 
ence rate of y,/(27) = 2.4 MHz, with contributions from both charge 
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Figure 2 | Strong single spin-photon coupling. a, A/Ao as a function of 
the cavity drive frequency fand an externally applied magnetic field BS“ 
for DQD1. Insets show data from DQD2 at the same values of t, and ¢ and 
plotted over the same range of f. BS“ ranges from —94 mT to —91.1 mT 
(91.1 mT to 94 mT) for the left (right) inset. b, A/Ao as a function of f for 
DQD1 at BS = 90.3 mT (red) and BS“ = 92.2 mT (blue) c, A/Ao asa 
function of f for DQD2 at BS = 91.1 mT (red) and BS = 92.6 mT (blue). 
In b and ¢, the frequency difference between the two transmission peaks, 
indicated by the black arrows, is 11.0 MHz (b) and 10.6 MHz (c). The 
spin—-photon coupling rate g,/(2%) corresponds to half the frequency 
separation and so is 5.5 MHz for DQD1 and 5.3 MHz for DQD2. 


decoherence and magnetic noise from the 7°Si nuclei, is extracted from 
microwave spectroscopy in the dispersive regime with 2t,/h =7.4GHz 
and ¢=0 (Extended Data Fig. 4), confirming that the strong-coupling 
regime g, > Y;, & has been reached. The spin-photon coupling rate 
obtained here is more than four orders of magnitude larger than rates 
currently achievable using direct magnetic-dipole coupling to 
lumped-element superconducting resonators*”*». 

The local magnetic field that is generated using cobalt micromagnets 
is very reproducible, as evidenced by examining the other DQD in the 
cavity. Measurements on DQD2 show vacuum Rabi splittings at 
BS = +92.6 mT (Fig. 2a, insets). The spin—photon coupling rate and 
spin decoherence rate are determined to be g,/(2) =5.3 MHz and 
ys/(27) = 2.4 MHz, respectively (Fig. 2c). These results are highly 
consistent with DQD1, and so we henceforth focus on DQD1. 


Electrical control of spin-photon coupling 

For quantum information applications it is desirable to turn qubit- 
cavity coupling rapidly on for quantum-state transfer and rapidly off 
for qubit-state preparation. Rapid control of the coupling rate is often 
accomplished by quickly modifying the qubit-cavity detuning f, — f.. 
Practically, such tuning can be achieved by varying the qubit transi- 
tion frequency f, with voltage or flux pulses**’ or by using a tunable 
cavity”. These approaches are not directly applicable for control of 
the spin-photon coupling rate because f, depends primarily on mag- 
netic fields that are difficult to vary on nanosecond timescales. In this 
section, we show that control of the spin-photon coupling rate can be 
achieved electrically by tuning ¢ and tf, (refs 32, 40). 

We first investigate the « dependence of g,. In Fig. 3a we show meas- 
urements of A/Ao as a function of BS“ and f for ¢=0, ¢ = 20,1eV and 
€=40peV. At e=20 peV (about 4.8 GHz), vacuum Rabi splitting 
is observed at BS = 92.1 mT with a spin-photon coupling rate of 
g;/(2%) = 1.0 MHz that is substantially lower than the value of 
gs/(27) = 5.5 MHz obtained at ¢=0. At ¢=40 eV (about 9.7 GHz), 
only a small dispersive shift is observed in the cavity transmission 
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Figure 3 | Electrical control of spin-photon coupling. a, A/Ao as a 
function of fand BS“ at e =0 (left), ¢ = 20 jreV (about 4.8 GHz; middle) 
and ce = 40 peV (about 9.7 GHz; right), with 2t./h = 7.4 GHz. Insets show 
A/Agasa function of fat the values of BS“ indicated by the white dashed 
lines in the main panels. b, Spin-photon coupling rate g,/(2) (top) and 
spin decoherence rate y,/(27) (bottom) as functions of 2f,/h, with «= 0 
(data). The dashed lines show theoretical predictions. A potential 
uncertainty of 0.01-0.1 MHz exists for each value of g,/(2) and 7,/(27) 
owing to uncertainties in the locations of the transmission peaks used to 
determine g,/(2) (Extended Data Fig. 5) and the widths of the Lorentzian 
fits used to determine y,/(21) (Extended Data Fig. 4). c, DQD energy levels 
as a function of , calculated with B® + BM = 209 mT, BM = 15 mT and 
2t./h =7.4 GHz. Here B™ denotes the magnetic field produced by the 
cobalt micomagnet parallel to BS“, and B;" is related to the strength of the 
inhomogeneous magnetic field perpendicular to BS“. The symbols f (|), 

L (R) and — (+) denote the quantum states of the electron that correspond 
to up (down) spin states, left-dot (right-dot) orbital states and molecular 
bonding (anti-bonding) states, respectively. The schematics at the top 
illustrate the distribution of the wavefunction of the electron in different 
regimes of €. For ¢ >> t, and —¢€ > t,, the electron is localized within one 
dot and tunnelling between the dots is largely forbidden, resulting in a 
small g, due to a small effective oscillating magnetic field. For |e| < t., the 
electron may tunnel between the two dots and experience a large 
oscillating magnetic field due to the spatial field gradient, resulting in a 
large g,. 


spectrum at BS = 91.8 mT, suggesting a further decrease in g,. These 
observations are qualitatively understood by considering that at ¢=0 
the electron is delocalized across the DQD and forms molecular 
bonding (|—)) or anti-bonding (|+-)) charge states (Fig. 3c). In this 
regime, the cavity electric field leads to a displacement of the electron 
wavefunction of the order of 1nm (Methods)**. Consequently, the 
electron spin experiences a large oscillating magnetic field, resulting in 
a substantial spin—-photon coupling rate. By contrast, with |<| >t, the 
electron is localized within one dot and it is natural to work with a basis 
of localized electronic wavefunctions |L) and |R), where L and R 
correspond to the electron being in the left and right dot, respectively 
(Fig. 3c). In this effectively single-dot regime, the displacement of the 
electron wavefunction by the cavity electric field is estimated to be 
about 3 pm for a single-dot orbital energy of Eo,p =2.5 meV (ref. 48), 
greatly suppressing the spin—-photon coupling mechanism*’. The large 
difference in the effective displacement lengths between the single-dot 
and double-dot regimes also implies an improvement in the spin- 
photon coupling rate at ¢ =0 of approximately two orders of magnitude 
compared to |e| >>¢,. Alternatively, the reduction of g, may be 
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Figure 4 | Quantum control and dispersive readout of a single spin. 

a, Cavity phase response Ad at f=f, when gate P1 is driven continuously 
at a variable frequency f, and power P, = —106 dBm, with 2t,/h =9.5 GHz 
and ¢=0. A background phase response, obtained by measuring A¢(BS") 
in the absence of a microwave drive on P1, is subtracted from each column 
of the data to correct for slow drifts in the microwave phase. b, Electron 
spin resonance (ESR) line as measured in A@(f,) at 2t,/h = 11.1 GHz, ¢=0, 
B:“' = 92.18 mT and P, = —123 dBm (data). The dashed line shows a fit to 
a Lorentzian with a full-width at half-maximum of 7,/7 = 0.81 + 0.04 MHz 
(indicated by the arrows). c, Schematic showing the experimental sequence 
for coherent spin control and measurement. Spin control is performed 
using a high-power microwave burst when the electron is largely localized 
within one dot ({e| >> f,) and spin-photon coupling is turned off. Spin- 
state readout is achieved using the dispersive response of a cavity photon 
at e =0 and when spin-photon coupling is turned on. d, Ag as a function 
of Tp, with 2t./h = 11.1 GHz and ec’ =70 jteV, showing single-spin Rabi 
oscillations. The excited-state population of the spin qubit P; is indicated 
on the right y axis (see Methods). 


interpreted as a result of suppressed hybridization between the |—, T) 
and |+, |) states due to their growing energy difference at larger |<|, as 
evident from Fig. 3c (see discussion below). Here 7 (|) denotes an elec- 
tron spin that is aligned (anti-aligned) with BS“. These measurements 
highlight the important role of charge hybridization in the DQD. 
Additional electric control of g, is enabled by voltage tuning f. (Fig. 1f). 
In Fig. 3b we show g,/(27) and 7,/(27) as functions of 2t./h at e=0, as 
extracted from vacuum Rabi splitting measurements and microwave 
spectroscopy of the electron spin resonance (ESR) transition line width 
(Figs 2b, 4b, Extended Data Figs 4, 5). Both rates increase rapidly as 2t,/h 
approaches the Larmor precession frequency Ez/h 5.8GHz, and a 
spin-photon coupling rate as high as g,/(2) = 11.0 MHz is found at 
2t./h = 5.2 GHz. These trends are consistent with the DQD energy-level 
spectrum shown in Fig. 3c*3*4!, With 2t,/h >> Ez/hand <=0, the two 
lowest energy levels are |—, |) and |—, {) and the electric-dipole 
coupling to the cavity field is small. As 2¢, is reduced and made compa- 
rable to Ez, the ground state remains |—, |) but the excited state becomes 
an admixture of |—, |) and |+, |) owing to the magnetic-field gradient 
BM, — By, = 2B" and the small energy difference between the states. 
The quantum transition that is close to resonance with Ez is now par- 
tially composed of a change in charge state from — to +, which responds 
strongly to the cavity electric field and gives rise to larger values of g.. 
For 2t./h < Ez/h, a decrease in f, increases the energy difference between 
|—, }) and |+, |), which reduces their hybridization and results in a 
smaller g,. We note that hybridization with charge states increases the 
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susceptibility of the spin to charge noise and relaxation, resulting in an 
effective spin decoherence rate ¥; that is also strongly dependent on ft 
(Fig. 3b)*3344!, Theoretical predictions of g, and +, as functions of 2¢,/h, 
based on measured values of g. and ¥. (Fig. le) are in good agreement 
with the data (Fig. 3b)". The discrepancy in the fit of +, is discussed in 
Methods. The electric control of spin-photon coupling demonstrated 
here allows the spin qubit to switch quickly between regimes with strong 
coupling to the cavity and idle regimes in which the spin-photon 
coupling rate and susceptibility to charge decoherence are small. 


Dispersive readout of a single spin 
The preceding measurements demonstrate the ability to couple a single 
electron spin to a single photon coherently, potentially enabling long- 
range spin-spin couplings***”. For the device to serve as a building 
block for a quantum processor, it is also necessary to prepare, control 
and read out the spin state of the trapped electron deterministically. We 
first induce spin transitions by driving gate P1 with a continuous micro- 
wave tone of frequency f; and power P;= —106 dBm. When f, = Ez/h, 
the excited-state population of the spin qubit P; increases and the 
ground state-population P| decreases. In the dispersive regime, in 
which the qubit-cavity detuning A/(217) = Ez/h — f, satisfies 
|A/(27)| >g,/(2n), the ae | transmission experiences a phase 
response Ag x tan (2g? /(KA)] for a fully saturated (P; = 0.5) 
qubit!*. It is therefore possible to measure the spin state of a single 
electron by probing the cavity transmission. As a demonstration, we 
spectroscopically probe the ESR transition by measuring A@ as a func- 
tion of f,and BS (Fig. 4a). These data are acquired with 2t./h=9.5 GHz 
and e =0. The ESR transition is clearly visible as a narrow feature with 
A@ =~ 0 that shifts to higher f, with increasing BS“. A¢@ also changes 
sign as BS“ increases, consistent with the sign change of the qubit-cavity 
detuning A when the Larmor precession frequency Ez/h exceeds f.. 
The nonlinear response in the small region around BS“ = 92 mT is due 
to the breakdown of the dispersive condition | A/(21)| > g,/(27). 
Finally, we demonstrate coherent single-spin control and dispersive 
spin-state readout. For these measurements, we choose ¢ = 0 and 
2t,/h= 11.1 GHz to minimize the spin decoherence rate ¥, (Fig. 3b). 
Here the spin-photon coupling rate g,/(27) = 1.4 MHz (Fig. 3b). The 
external field is fixed at B* = 92.18 mT, which ensures that the system 
is in the dispersive regime with A/(2x) = 14 MHz > g,/(21). A meas- 
urement of Ad(f,) in the low-power limit (Fig. 4b) yields a Lorentzian 
line shape with a full-width at half-maximum of 0.81 MHz, which 
corresponds to a low spin decoherence rate of y./(2) = 0.41 MHz 
(refs 19, 42). Qubit control and measurement are achieved using the 
pulse sequence illustrated in Fig. 4c. Starting with a spin-down state 
||) at e=0, the DQD is pulsed to a large detuning ¢’ = 70 j1eV (about 
17 GHz), which decouples the spin from the cavity. A microwave burst 
with frequency f; = 5.874 GHz, power P,= —76 dBm and duration T, 
is subsequently applied to P1 to drive a spin rotation?°*, The DQD 
is then pulsed adiabatically back to =0 for a fixed measurement time 
Tm for dispersive readout. To reinitialize the qubit, we choose 
Tu = 20 pts > Ti(e = 0), where T)(¢ =0) = 3.2 1s is the spin relaxation 
time measured at ¢ =0 (Extended Data Fig. 6). Figure 4d displays the 
time-averaged Ad as a function of Tp, obtained with an integration time 
of 100 ms for each data point. We observe coherent single-spin Rabi 
oscillations with a Rabi frequency of fg =6 MHz. In contrast to readout 
approaches that rely on spin-dependent tunneling”**”, our dispersive 
cavity-based readout corresponds in principle to quantum non- 
demolition readout™. The readout scheme is also distinct from previous 
work that used a cavity-coupled InAs DQD, which detects the spin state 
through Pauli blockade rather than spin-photon coupling*. In addition 
to enabling single spin-photon coupling, our device is capable of 
preparing, controlling and dispersively reading out single spins. 


Conclusion 
We have realized a coherent spin—-photon interface at which a single 
spin in a silicon DQD is strongly coupled to a microwave-frequency 
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photon through the combined effects of the electric-dipole interac- 
tion and spin-charge hybridization (see Methods for a discussion of 
the prospects of applying the spin—photon interface to realize cavity- 
mediated spin-spin coupling). Spin-photon coupling rates of up 
to 11 MHz are measured in the device, exceeding magnetic-dipole 
coupling rates by nearly five orders of magnitude. The spin decoher- 
ence rate is strongly dependent on the inter-dot tunnel coupling t, and 
ranges from 0.4 MHz to 6 MHz, possibly limited by a combination of 
charge noise, charge relaxation and remnant nuclear field fluctuations. 
All-electric control of spin-photon coupling and coherent manipula- 
tion of the spin state are demonstrated, along with dispersive readout of 
the single spin, which lays the foundation for quantum non-demolition 
readout of spin qubits. These results could enable the construction of 
an ultra-coherent spin quantum computer with photonic interconnects 
and readout channels, with the capacity for surface codes, ‘all-to-all’ 
connectivity and easy integration with other solid-state quantum 
systems such as superconducting qubits**46470->?, 

We note that two related preprints appeared after the submission 
of this Article: ref. 53 presents some of the results discussed here, and 
ref. 54 explores a different approach to spin-photon coupling 
and demonstrates coherent coupling of a triple quantum dot to 
microwave-frequency photons. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Device fabrication and measurement. The Si/SiGe heterostructure consists of a 
4-nm-thick Si cap, a 50-nm-thick Sig 7Geo,3 spacer layer, a 8-nm-thick natural-Si 
quantum well and a 225-nm-thick Sip 7Geo3 layer on top of a linearly graded 
Si, .Ge, relaxed buffer substrate. Design and fabrication details for the super- 
conducting cavity and DQDs are described elsewhere’’. The approximately 
200-nm-thick Co micromagnet is defined using electron beam lithography and lift 
off. In contrast to earlier devices, the gate filter for P1 was changed to an L;-C-L 
filter, with L; =4 nH, C=1 pF and Lj)= 12 nH (ref. 43). This three-segment filter 
allows microwave signals below 2.5 GHz to pass with less than 3 dB of attenuation. 

All data are acquired in a dilution refrigerator with a base temperature of 10 mK 
and electron temperature of T. =60mK. The measurements of the transmission 
amplitude and phase response of the cavity (Figs 1, 4) are performed using a homo- 
dyne detection scheme”’. The measurements of the transmission spectra of the 
cavity (Figs 2, 3) are performed using a network analyser. The microwave drive 
applied to P1 (Fig. 4) is provided by a vector microwave source and the detuning 
pulses are generated by an arbitrary waveform generator, which also controls the 
timing of the microwave burst (Fig. 4d). 

To maximize the magnetization of the Co micromagnet and minimize 
hysteresis, data at positive (negative) external applied magnetic fields (Fig. 2a) are 
collected after Bot is first ramped to a large value of +300 mT (—300 mT). A small 
degree of hysteresis still remains for the micromagnet of DQD1, as can be seen by 
the different magnitudes of BS“ at which positive- and negative-field vacuum Rabi 
splittings are observed (Fig. 2a). In Fig. 4a, the slope of the ESR transition is 
d(Ez/h)/dBS* = 44 MHz mT™}, which is higher than the value (28 MHz mT™!) 
expected for a fully saturated micromagnet. The slope of the transition suggests 
that the micromagnet is not fully magnetized and has a magnetic susceptibility of 
dBM/dB® ~ 0.6 around BS = 92 mT. 

Estimate of displacement length. The displacement length of the electron wave- 
function by the cavity electric field may be estimated by considering the spin— 
photon coupling strength. For g,/(27) = 10 MHz, the effective AC magnetic field 
BES® that drives ESR is Brs® = [g,/(2%)][h/(gip)] © 0.4 mT. The field gradient 
for our DQD is 2BM /1=0.3 mT nm}, where /= 100nm is the inter-dot distance. 
Therefore, the effective displacement of the electron wavefunction is estimated to 
be about 1 nm in the DQD regime. For a single dot, the spin-photon coupling 
strength is expected to be g /(2m) & (gitpB™ /Eor)(g./27) ~ 30 kHz(refs 31, 33) 
for an orbital energy of Eo.) =2.5 meV (ref. 48). The equivalent AC magnetic field 
that is induced by the cavity is therefore BES® ~ 1 wT, corresponding to a displace- 
ment length of only about 3 pm. 

Conversion of cavity phase response to spin population. For the dispersive read- 
out of the Rabi oscillations (Fig. 4d), the theoretically expected cavity phase 
response is ¢, = tan” ![2g?/(«A)] = 9.6° when the spin qubit is in the excited 
state, and @ | = —tan™'[2g°/(KA)] =—9.6° when the spin qubit is in the ground 


state*?°>, Because our measurement is averaged over Tm > Ti, spin relaxation dur- 
ing readout will reduce the phase contrast observed in the experiment. To enable 
a conversion between the phase response of the cavity A¢ and the excited-state 
population of the spin qubit P;, we measure the spin relaxation time T; by fixing 
the microwave burst time at 7, = 80 ns, which corresponds to a 7 pulse on the spin 
qubit. The phase response of the cavity Ad is then measured as a function of Ty 
for Ty > 5 \1s > T; (Extended Data Fig. 6). The result is fitted to a function of the 
form Ag= ¢o + ¢1(Ti/Tw) [1 — exp(—Tw/T})] to extract T; = 3.2 1s, where @p and 
¢; are additional fitting parameters*®. We have ignored the effects of the cavity 
ringdown time 1/« ~90ns and the 1-pulse time of 80 ns in the fit, because both of 
these times are much shorter than the measurement time Ty. The phase contrast 
that results from the fit, 6; + 17.7°, is close to the maximum contrast expected at 
this spin-photon detuning, ¢; — ¢, = 19.2°. On the basis of this value of T, we 
convert the measured phase response into the excited-state population via 
P) = (1+ Ad/6;,)/2, where $),.= O)(Ti/Tw) [1 — exp(—Ty/T))] = 1.5° is the 
reduced phase response due to spin relaxation during the readout time Ty = 20 1s. 
The converted spin population P; shown in Fig. 4d has a visibility of about 70%, 
which could be improved by performing single-shot measurements”. 
Input-output theory for cavity transmission. Here we briefly summarize the 
theoretical methods used to calculate the cavity transmission A/Ay shown in 
Fig. le and Extended Data Fig. 7; see ref. 41 for a detailed description of the theory. 
We start from the Hamiltonian that describes the DQD 


1 
Hj)= 3 em + 2t.T; + Bzoz + Be ote) (1) 


where 7, and 7, are Pauli operators that act on the orbital charge states of the DQD 
electron, 0, and o, are Pauli operators that act on the spin states of the electron, 
B, = B%'+ BM denotes the total magnetic field along the z axis and 
BY (BM, - BM) /2 is half the magnetic field difference of the DQD in the 
x direction. In the theoretical model, we have assumed that the average magnetic 


field in the x direction satisfies (Bee + By) /2=0, which is a good approximation 
given the geometry of the micromagnet and its alignment with the DQD. We add 
the electric-dipole coupling to the cavity with the Hamiltonian 


Hy=g (ata')r, 


where a and a’ are the photon operators for the cavity. The electric-dipole operator 
can be expressed in the eigenbasis {|n)} of Hp as 


T= S> dnm|n)(m| 


nm=0 


We then write the quantum Langevin equations for the operators a and 
nm = |n) (mI: 


3 
La K — 
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where Ap = wr — w, is the detuning of the driving field frequency (wp=2m/) 
relative to the cavity frequency (w.=27f-) and Pam =Pn — Pm is the population 
difference between levels n and m (p,, can, for example, be assumed to be a 
Boltzmann distribution in thermal equilibrium). This description is equivalent to 
a more general master-equation approach in the weak-driving regime in which 
population changes in the DQD can be neglected. Furthermore, «, and kz are the 
photon decay rates at ports 1 and 2 of the cavity and ain, is the input field of the 
cavity, which we assume to couple through port 1 only (ain2 =0). The quantum 
noise of the DQD Fis neglected in what follows. The super-operator y with matrix 
elements Ynm,n'm' tepresents decoherence processes, including charge relaxation 
and dephasing due to charge noise (these processes also imply some degree of spin 
relaxation and dephasing due to spin-charge hybridization via B’). Our goal is to 
relate the incoming parts ain; of the external field at the ports to the outgoing fields 
Aout,i = [Kid — Gini. The transmission A = dout,2/din,1 (where the overbars 
denote time-averaged expectation values) through the microwave cavity is then 
computed using a rotating-wave approximation to eliminate the explicit time 
dependence in equation (2), by solving the equations for the expected value of these 
operators in the stationary limit (@ and G, ): 


-i KyiK2 


A= 
—Ao ~ in /2 +; baa payaay Ann +jXnn+j 


where X14. j= Fnntj /a are the single-electron partial susceptibilities and dj are 
the dipole-transition matrix elements between DQD states. 

Theoretical models for spin-photon coupling and spin decoherence. Here we 
present a brief derivation of the analytical expressions for the spin-photon coupling 
rate g, and the spin decoherence rate 7. A more extensive discussion of spin-photon 
coupling and spin decoherence specific to our device architecture is presented in 
ref. 41. We focus on the e=0 regime used in Fig. 3b. Accounting for spin-charge 
hybridization due to the field gradient BM, the relevant eigenstates of the DQD 
Hamiltonian in equation (1) are |0) ~|—, |), |1) =cos(@/2)|—, |) + sin(@/2)|+, |), 
|2) =sin(®/2)|—, +) — cos(@/2)|+, |) and |3) ~|+, 1). Here we have introduced 
a mixing angle ® = tan™'[gy.,B™ /(2t. — gjt,B-)]. The dipole-transition matrix 
element for the primarily spin-like transition between |0) and |1) is do, + —sin(@/2) 
and the dipole-transition matrix element for the primarily charge-like transition 
between |0) and |2) is do, + cos(@/2). The transition between |0) and |3) is too high 
in energy (off-resonance) and is therefore excluded from our model. The spin- 
photon coupling rate is g, = g.|do1| = g-|sin(@/2)|, in agreement with previous 
theoretical results****, 

To calculate the effective spin decoherence rate - that arises from charge 
decoherence, we first construct the operators 01 = |0) (1| cos(@/2)a; + sin(®/2)o, 
and 092 =|0) (2| + sin(@/2)o, — cos(®/2)o,. Here o,=|—, |) (—, T| and o,=|-, |) 
(+; || are lowering operators for the electron spin and charge, respectively. 
Assuming that the electron charge states have a constant decoherence rate 
Vc= 1/2 + Yeo Where 7; is the charge relaxation rate and 7, is a dephasing rate due 
to charge noise”, the equations of motion for these operators are 


. of & sin(®) 
601 = J} — sin 5 Oo1+ 002 
. sin(®) (2 
2=% 2 901 — COS 2 %2 
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Combined with charge-photon coupling, the overall equations of motion 
(equation (2)) in a rotating frame with a drive frequency f~ f- assume the form 


ae K ; 
a=iAgoa re + [Fi Gini — ig.(do1901 + do2002) 
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The 6; and 63 terms are defined as 6)/(27) = (E, — Eo)/h — f and 6)/(27) 
= (E — E)/h — f, where E,1,2 correspond to the energy of the |0), |1) and |2) 
state, respectively. Steady-state solutions to the above equations give the 
electric susceptibility of the spin qubit transition y, , = 001/a = g,/(é1 — iy), 
where we have identified a charge-induced spin decoherence rate 
0 = 7.[6asin2(/2) + 6\cos*(@/2)]/52. To account for spin dephasing due to 
fluctuations of the 7°Si nuclear spin bath, we express the total spin decoherence 
rate assuming a Voigt profile: 
2 
1 
T 5 nudlear 


2 


(c) 
+8 no 


1s 


ak 


where Ts nuclear ¥ 1 ps is the electron-spin dephasing time due to nuclear field 
fluctuations!??*, 

When fitting the data in Fig. 3b, we use the experimentally determined values 
of g-/(27) = 40 MHz and 4./(27) = 35 MHz, along with the best-fitting field 
gradient BM = 15 mT. For every f,, the fitted value for B, is adjusted so that the 
spin-qubit frequency (E; — Eo)/h matches the cavity frequency f, exactly. The slight 
discrepancy between theory and experiment for 7, could be due to the frequency 
dependence of +, changes in 7. with Ber or other decoherence mechanisms that 
are not captured by this simple model. To resolve such a discrepancy, a complete 
measurement of 7, as a function of 2t,/h and the external field Bot is needed. 

The complete theory! also allows g,/(27) to be calculated for non-zero values 
of e. Using 2t./h= 7.4 GHz, we estimate g,/h = 2.3 MHz at ¢ = 20 jeV (about 
4.8 GHz), close to the value of g./h = 1.0 MHz measured at this DQD detuning 
(Fig. 3a). 

In this theoretical model, we have ignored Purcell decay of the spin qubit 
through the cavity”’. This is justified because +, at every value of tf, is measured 
with a large spin—cavity detuning A ~ 10g,. The expected Purcell decay rate 
of the spin qubit is Ip/(27) = [eg?/(K?/4 + A?)]/(27) 0.02 MHz, well below 
the measured values of y,/(27). We also note that, at least in the 2, >> Ez limit, 
spin decoherence at ¢ =0 is dominated by noise-induced dephasing rather than 
by energy relaxation. This is because at 2¢,/h = 11.1 GHz the spin decoherence rate 
«/(27) =0.41 MHz corresponds to a coherence time of T) = 0.4 tus < 2T, = 6.4 1s. 
Line shapes of vacuum Rabi splittings. In contrast to charge-photon 
systems’, the two resonance modes in the vacuum Rabi splittings (Fig. 2b, c) 
show slightly unequal widths. This effect can be seen by comparing the observed 
spectrum of DQD1 with the expected behaviour of an equivalent two-level charge 
qubit that is coupled strongly to a cavity, calculated using a master-equation simu- 
lation with thermal photon number my, = 0.02 (black dashed line in Extended Data 
Fig. 7). The unequal widths are unlikely to be a result of a large thermal photon 
number in the cavity, because the transmission spectrum calculated with ny, =0.5 
(orange dashed line) clearly does not fit the experimental data*”. 

Instead, the observed asymmetry probably arises from the dispersive interaction 
between the cavity and the primarily charge-like transition between |0) and |2), 
which results in three-level dynamics that is more complicated than the two-level 
dynamics that characterizes charge-photon systems. A more complete treatment 
of this effect is given in ref. 41. Here we compare the experimental observation 
with theory by calculating A(f)/Ao using g./(27) = 40 MHz (DQD1) or g./(27) 
= 37 MHz (DQD2), 7./(27) = 105 MHz (DQD1) or 7,/(27) = 130 MHz (DQD2), 
«/(2%) = 1.8 MHz, tunnel couplings 2t./h = 7.4 GHz, pM =15mT and 
B,=209.6 mI. The results are shown as black solid lines alongside experimental 
data in Extended Data Fig. 7. The agreement between experiment and theory is 
very good for both devices. In particular, the asymmetry between the vacuum Rabi 
modes is also seen in the theoretical calculations. The larger values of 7, used in 
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the theoretical calculations may again be due to the frequency dependence of 7. 
or to changes in 7, with BS’. Further experiments are needed to resolve this 
difference. 

Prospects for long-range spin-spin coupling. The coherent spin—photon inter- 
face may be readily applied to enable spin-spin coupling across the cavity bus. Here 
we evaluate two possible schemes for implementing such a coupling, both of which 
have been demonstrated with superconducting qubits**“”, The first approach uses 
direct photon exchange to perform quantum-state transfer between two qubits’”. 
The transfer protocol starts by tuning qubit 1 into resonance with the unpopulated 
cavity for a time 1/(4g,), at the end of which the state of qubit 1 is transferred com- 
pletely to the cavity. Qubit 1 is then detuned rapidly from the cavity and qubit 2 is 
brought into resonance with the cavity for a time 1/(4g,), at the end of which the 
state of qubit 1 is transferred completely to qubit 2. Therefore, the time required 
for quantum-state transfer across the cavity is 1/(2g,). Because the decay of vacuum 
Rabi oscillations occurs at a rate «/2 + 7, the threshold for coherent-state transfer 
between two spin qubits is 2g,/(K/2 + 7s) > 1. The ratio 2g,/(K/2 + 7.) is plotted as 
a function of 2t,/h in Extended Data Fig. 8a. It can be seen that 2g,/(«/2 +.) >1 
for all values of 2t,/h, indicating that spin-spin coupling via real photon exchange 
is achievable and may be implemented at any value of f,. For our device parameters, 
the regime 2t,/h 6 GHz, in which spin-charge hybridization is large and the 
ratio 2g,/(«/2 + ys) reaches a maximum of 3.5, seems most advantageous for such 
a coupling scheme. 

The second approach to spin-spin coupling uses virtual photon exchange*®. In 
this scheme, both spin qubits would operate in the dispersive regime, with an 
effective coupling rate of J = ge(l /A, + 1/A2)/2, where A, and A; are the qubit- 
cavity detunings for qubits 1 and 2, respectively. Assuming that both qubits operate 
with an equal detuning A;.= 10g, to minimize Purcell decay, J= g,/10. For 
coherent spin-spin interaction, J >, needs to be satisfied, leading to the condition 
&/Vs > 10. In Extended Data Fig. 8b, we plot the ratio g,/7, as a function of 2t,/h, 
observing a maximum of g,/7, 4 at 2t./h= 10 GHz. Because the dominant spin 
mechanism is probably hyperfine-induced dephasing by the ”°Si nuclei in this 
regime (the decoherence rate +,/(2%) + 0.4 MHz is close to the decoherence rates 
commonly found with single-spin qubits in natural Si; ref. 38), transitioning to 
isotopically purified 78Si host materials is likely to lead to an order-of-magnitude 
reduction in ¥,/(27), as demonstrated recently*®. Such an improvement will allow 
virtual-photon-mediated spin-spin coupling to be implemented in our device 
architecture as well. 

Last, we note that both coupling approaches will benefit substantially from 
larger values of the charge-photon coupling rate g., which is achievable through 
the development of higher-impedance cavities””°’. The superconducting cavity 
used here is estimated to have an impedance between 200 (2 and 300 2. Increasing 
this value to about 2 kQ., which is possible by using NbTiN as the superconducting 
material, will lead to another factor-of-three increase in g. and therefore g,. This 
could enable the g,/, > 100 regime to be accessed, where high-fidelity two-qubit 
gates can be implemented between distant spins. Improvements in the fidelity of 
cavity-mediated two-qubit gates, particularly in the case of real photon exchange, 
can also be sought by improving the quality factor of the cavity (and thereby 
reducing «). This is achievable by implementing stronger gate line filters’? and 
removing lossy dielectrics such as the atomic-layer-deposited Al,O3 underneath 
the cavity. 

Data availability. The data that support the findings of this study are available 
from the corresponding author on reasonable request. Source Data for Figs 1-4 
and Extended Data Figs 2-8 are available with the online version of the paper. 
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Extended Data Figure 1 | Micromagnet design. To-scale drawing of the 
micromagnet design, superimposed on top of the SEM image of the DQD. 
The coordinate axes and the direction of the externally applied magnetic 
field BS are indicated at the bottom. In this geometry, the DQD electron 
experiences a homogeneous z field B, ~ BS + BM. The total x field B, that 


is experienced by the electron is spatially dependent, being approximately 
By (Be) when the electron is in the L (R) dot (|e| > f,) and 

(BM, + BM) /2 when the electron is delocalized between the DQDs (¢ = 0). 
The y field B, for the DQD electron is expected to be small compared to 
the other field components for this magnet design. 
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Extended Data Figure 2 | Photon number calibration. The ESR 
resonance frequency fgsr, measured using the phase response of the 
cavity A¢@ in the dispersive regime (Fig. 4b), is plotted as a function 
of the estimated power at the input port of the cavity P (data). The 
device is configured with g,/(27) = 2.4 MHz and spin-photon 
detuning A/(27) + —18 MHz. The dashed line shows a fit to 
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Sesp =Sesp (P = 0) + (2npng?/A)/(2m), where Mpp is the average number 
of photons in the cavity, plotted as the top x axis. The experiments are 
conducted with P~ —133 dBm (0.05 fW), which corresponds to mph ~ 0.6. 


The error bars indicate the uncertainties in the centre frequency of the 
ESR transition. 
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Extended Data Figure 3 | DQD stability diagrams. The cavity on the basis of these measurements and subsequently tuned close to 
transmission amplitude A/Ap (a, c) and phase response Ad (b, d) are resonance with the cavity for the experiments described in the main text. 
plotted as functions of Vp; and Vp; for DQD1 (a, b) and DQD2 (c, d), The red circles indicate the locations of the (1, 0) < (0, 1) transitions of 
obtained with f=f.. The (1, 0) < (0, 1) transitions are clearly identified the two DQDs. 
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Extended Data Figure 4 | Spin decoherence rates at different DQD and 7,/(27) is determined as the half-width at half-maximum of each 
tunnel couplings. ESR line, as measured in the cavity phase response Lorentzian. The spin-photon detuning | A|~10 g, for each dataset, to 
Ao(f,); is shown for different values of 2t./h in the low-power limit (data). ensure that the system is in the dispersive regime. 


€ =0 for every dataset. Dashed lines are fits with Lorentzian functions 
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Extended Data Figure 5 | Spin-photon coupling strengths at different 
DQD tunnel couplings. a, b, Vacuum Rabi splittings for 2t,/h < fc (a) 
and 2t,/h > f- (b), obtained by varying BS“ until a pair of resonance peaks 
with approximately equal heights emerges in the cavity transmission 
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spectrum A/Ap. g,/(2T) is then estimated as half the frequency difference 
between the two peaks. ¢ = 0 for every dataset. g, is difficult to measure for 
5.2 GHz < 2t./h < 6.7 GHz owing to the small values of A/Ap that arise 
from the large spin decoherence rates 7, in this regime. 
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Extended Data Figure 6 | Spin relaxation at ¢ = 0. The time-averaged using the function 9 + ¢1(T)/Tm)[1 — exp(—Tm/T})], which yields a spin 
phase response of the cavity Ad is shown as a function of wait time Ty relaxation time of T, + 3.2 1s. The experimental conditions are the same as 
(data), measured using the pulse sequence illustrated in Fig. 4c. The for Fig. 4d. 


microwave burst time is fixed at 7, = 80 ns. The dashed line shows a fit 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


5:82 5.84 5.86 5.B2 5.84 5.86 
f (GHz) f (GHz) 


Extended Data Figure 7 | Theoretical fits to vacuum Rabi splittings. BM = (Bea _ BM) /2 = 15 mT and 2t,/h=7.4 GHz for DQD1 (DQD2). 
The calculated cavity transmission spectra (black solid lines) are For comparison, A(f)/Ao, simulated for a two-level charge qubit with 
superimposed on the experimentally measured vacuum Rabi a decoherence rate of 7./(21) = 2.4 MHz coupled to a cavity with 
splittings shown in Fig. 2b, c (data). The calculations are produced (2m) = 1.8 MHz at a rate g./(2n) = 5.5 MHz, is shown in a for 

with g./(27) = 40 MHz (g./(27) = 37 MHz), Ki(2n) = 1.8 MHz, thermal photon numbers of n,, = 0.02 (black dashed line) and 

Yl (2%) = 105 MHz (¥./(27) = 120 MHz), B, = BY + B, = 209 mT, nth = 0.5 (red dashed line). 
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Extended Data Figure 8 | Prospect for long-range spin-spin coupling. a, The ratio 2g,/(«/2 + 7.) as a function of 2t,/h, calculated using the data in 
Fig. 3b and «/(27) = 1.8 MHz. b, The ratio g,/7,; as a function of 2t,/h, also calculated using the data in Fig. 3b. 
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Planning chemical syntheses with deep 
neural networks and symbolic Al 


Marwin H. S. Segler!?, Mike Preuss? & Mark P. Waller* 


To plan the syntheses of small organic molecules, chemists use retrosynthesis, a problem-solving technique in which 
target molecules are recursively transformed into increasingly simpler precursors. Computer-aided retrosynthesis 
would be a valuable tool but at present it is slow and provides results of unsatisfactory quality. Here we use Monte Carlo 
tree search and symbolic artificial intelligence (AI) to discover retrosynthetic routes. We combined Monte Carlo tree 
search with an expansion policy network that guides the search, and a filter network to pre-select the most promising 
retrosynthetic steps. These deep neural networks were trained on essentially all reactions ever published in organic 
chemistry. Our system solves for almost twice as many molecules, thirty times faster than the traditional computer-aided 
search method, which is based on extracted rules and hand-designed heuristics. In a double-blind AB test, chemists on 
average considered our computer-generated routes to be equivalent to reported literature routes. 


Retrosynthetic analysis is the canonical technique used to plan the 
synthesis of small organic molecules’”. In retrosynthesis, a search tree 
is built by ‘working backwards, analysing molecules recursively and 
transforming them into simpler precursors until one obtains a set of 
known or commercially available building-block molecules (Fig. 1)**. 
Given that transformations are formally reversed chemical reactions, 
the plan can be then carried out in the laboratory in the forward direc- 
tion to synthesize the target compound™, Transformations are derived 
from successfully conducted series of similar reactions with analogous 
starting materials, and are often named after their discoverers (‘named 
reactions)”. At each retrosynthetic step, a small set out of hundreds of 
thousands of transformations known in modern chemistry has to be 
selected. Ina pattern recognition process, chemists intuitively prioritise 
the most promising transformations, which they then consider, with- 
out actively thinking about the less promising ones®. However, when 
a transformation is applied to a new molecule, there is no guarantee 
that the corresponding reaction will proceed in the expected way’. 
A molecule failing to react as predicted is called ‘out of scope’. This 
can be due to steric or electronic effects, an incomplete understanding 
of the reaction mechanism, or conflicting reactivity in the molecular 
context. Predicting which molecules are ‘in scope’ can be challenging 
even for the best human chemists*’. 

Computer-assisted synthesis planning (CASP) could help chemists 
to find better routes faster, and is a missing component in virtual 
de novo design and robot systems performing molecular design- 
synthesistest cycles*!°. To perform CASP, the knowledge that humans 
gain must be transferred into an executable program'!~'°. Despite 
60 years of research, attempts to formalize chemistry by manual encod- 
ing by experts have not convinced synthetic chemists, and it does not 
scale to exponentially growing knowledge’*"'*. Methods of algorith- 
mically extracting transformations from reaction datasets” ~” have 
been criticized for high noise and lack of ‘chemical intelligence’ !>"". 
However, we recently showed that deep neural networks can learn to 
rank extracted symbolic transformations, and to avoid reactivity con- 
flicts, which mimics the expert's intuitive decision-making”’. To guide 
the search in promising directions, heuristic best first search (BFS) has 
been employed, in which hand-designed heuristic functions determine 


position values), Unfortunately, unlike in chess, it is difficult to define 
strong heuristics in chemistry for three reasons. First, chemists tend 
to disagree on what constitutes a good position”*”°. Second, although 
it is generally desirable to simplify the molecules, it can be tactically 
beneficial to temporarily increase complexity by the use of protecting 
or directing groups. Finally, the position value depends highly on the 
availability of suitable precursors'*'>. Even complex molecules can be 
made in a few steps if precursors are readily available. Therefore, one 
cannot reliably estimate the value of a synthetic position without com- 
pletely ‘playing the molecules until the end of the game’ 

Monte Carlo tree search (MCTS) has emerged as a general search 
technique for sequential decision problems with large branching 
factors without strong heuristics, such as games or automated theo- 
rem proving”® 8, MCTS uses rollouts to determine position values. 
Rollouts are Monte Carlo simulations, in which random search steps 
are performed without branching until a solution has been found or 
a maximum depth is reached. These random steps can be sampled 
from machine-learned policies p(t|s)”’, which predict the probability 
of taking the move (applying the transformation) ¢ in position s, and 
are trained to predict the winning move by using human games or 
self-play*?*. 

In this work, we combine three different neural networks together 
with MCTS to perform chemical synthesis planning (3N-MCTS). 
The first neural network (the expansion policy) guides the search in 
promising directions by proposing a restricted number of automati- 
cally extracted transformations. A second neural network then predicts 
whether the proposed reactions are actually feasible (in scope). Finally, 
to estimate the position value, transformations are sampled from a third 
neural network during the rollout phase. The neural networks were 
trained on essentially all reactions published in the history of organic 
chemistry. 


Training the expansion and rollout policies 

We extracted transformation rules from 12.4 million single-step reac- 
tions from the Reaxys*° chemistry database”’. Two sets of rules were 
extracted. The rollout set comprises rules that contain the atoms and 
bonds that changed in the course of the reaction (the reaction centre), 


'Mnstitute of Organic Chemistry and Center for Multiscale Theory and Computation, Westfalische Wilhelms-Universitat, Munster, Germany. @BenevolentAl, London, UK. ?European Research Center 
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Figure 1 | Translation of the traditional chemists’ retrosynthetic route 
representation to the search tree representation. a, The traditional 
chemists’ retrosynthetic route representation (conditions omitted)*’. 

b, The search tree representation. The nodes in the tree represent the 
synthetic position, and contain all precursors needed to make the 
molecules of the preceding positions all the way down to the tree’s 


and the first-degree neighbouring atoms. Only rules that occurred at 
least 50 times in reactions published before 2015 were kept. For the 
expansion rules, a more general rule definition was employed. Here, only 
the reaction centre was extracted. Rules occurring at least three times 
were kept. The two sets encompass 17,134 and 301,671 rules, and cover 
52% and 79% of all chemical reactions from 2015 and after, respectively. 

Rule extraction associates each reaction, and thus each product, 
with a transformation rule. This allows us to train neural networks 
as policies to predict the best transformations given the product, or 
in other words, the best reactions with which to make the product”’, 
Importantly, such neural networks also learn about the context in which 
the reactions can occur (functional group tolerance)’. For the expan- 
sion policy, we employed a deep highway network*’ with exponential 
linear unit nonlinearities**. To assess its ability to generalize, we per- 
formed a time-split strategy~’. For training, only reactions published 
before 2015 were used, whereas for validation and testing, data from 
2015 and later were selected. 

Extended Data Table 1 shows the metrics for the expansion pol- 
icy. The neural network predicts the correct solution out of 301,671 
transformations with an accuracy of 31%, which is reasonable. It has 
to be noted that there are almost always many feasible ways to make a 
molecule. The top 10 and top 50 accuracies of 63.3% and 72.5% indicate 
that the correct transformations are generally ranked highly. Beyond 
the top 50 predicted results, the accuracy increases only marginally. 
This observation allows us to reduce the branching factor drastically, 
which is 46,175 when rules are applied exhaustively. During search 
tree expansion, we restrict the possible transformations to a maximum 
of 50. Additionally, we sum the probabilities of the predicted actions, 
starting from the highest-ranked transformation. When the cumula- 
tive probability reaches 0.995, we stop further expansion, even if fewer 
than 50 actions have been expanded. This allows the system to focus 
on highly likely transformations when only a few good options exist, 
for example in the synthesis of acyl chlorides or Grignard reagents. We 
observed that the reactions in this reduced top 50 are almost always 
reasonable and are often variations of the correct prediction. For exam- 
ple, a Heck reaction can often be conducted with bromide, iodide or 
triflate as the leaving group. 

The rollout policy network, which is a neural network with one hid- 
den layer, is trained in the same way as the expansion policy. It uses a set 
of 17,134 rules, which implies a lower coverage than the expansion pol- 
icy, yet it needs just 10 ms to make a prediction, in contrast to 90 ms for 
the expansion policy, owing to the smaller output layer (see Extended 
Data Table 1). The rationale for using two different rule sets is to use a 
powerful but slow policy to select the best candidate transformations 
for expansion, and a fast rollout policy to estimate the position values*. 
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root, which contains the target. Branches in the search tree correspond 
to complete routes. Calculating the value of branches through task- 
dependent scoring functions allows us to compare and rank different 
routes. The target molecule can be solved if it can be deconstructed to a 
set of readily available building blocks (marked red). Ph, phenyl; Boc, 
tert-butyloxycarbony]l; TBS, tert-butyldimethylsilyl. 


Prediction with the in-scope filter network 

After the search space has been narrowed down by the expansion policy 
to the most promising transformations, we need to predict whether 
the corresponding reactions will actually work for a particular mole- 
cule. We trained a deep neural network as a binary classifier to predict 
whether the reactions corresponding to the transformations selected by 
the policy network are actually feasible”. The classifier has to be trained 
on successful and failed reactions. Unfortunately, failed reactions are 
rarely reported and not contained in reaction databases. However, 
published reactions contain implicit information about reactions that 
do not occur. For a high-yielding reaction A + B — C, we can assume 
that hypothetical products D, E,... are not formed. By applying reaction 
rules in the forward direction to the reactants of reported reactions, 
negative reactions, for example, with incorrect regio- and chemoselec- 
tivity, can then be generated*!”. Here we used the same rule set as for 
the expansion policy. Additionally, we generated negative examples by 
shuffling the associated pairs of products and corresponding reactions 
(see Methods for details). Using these data augmentation strategies, 
we generated 100 million negative reactions from reactions published 
before 2015 for training and 10 million published in and after 2015 
for testing. As positive cases, all reported reactions from these periods 
were used. On the test set, the classifier achieves an area under the 
receiver operation characteristic curve of 0.99, and an area under the 
precision-recall curve of 0.94, which indicates good performance (see 
Extended Data Fig. 1)*°. The false positive rate of the filter (that is, 
incorrect reactions passing the filter) is 1.5%, whereas the false negative 
rate (that is, real reactions being filtered out) is 14%. Interestingly, the 
in-scope filter correlates with basic electronic properties (Hammond 
parameters and lowest unoccupied molecular orbital (LUMO) ener- 
gies), even though it is not explicitly trained to do so (see Methods and 
Extended Data Fig. 5). 


Integrating neural networks and MCTS 

The expansion policy network and the in-scope filter network are com- 
bined into a pipeline (Fig. 2b). When a position s; is to be analysed, each 
molecule of the position is fed into the policy network. Then, the trans- 
formations with the highest scores are applied to the molecule, which 
yields the possible precursors and thus full reactions. These reactions 
are submitted to the in-scope filter, where only transformations and 
precursors corresponding to positively classified reactions are kept. 
They represent the ‘legal moves’ available in position s;. 

The expansion procedure and the rollout policy are then 
incorporated in the respective phases of an MCTS algorithm to 
form 3N-MCTS. The four MCTS phases are then iterated to build 
the search tree. 
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Figure 2 | Schematic of MCTS methodology. a, MCTS searches by 
iterating over four phases. In the selection phase (1), the most urgent 
node for analysis is chosen on the basis of the current position values. 
In phase (2) this node may be expanded by processing the molecules of 
the position A with the expansion procedure (b), which leads to new 
positions B and C, which are added to the tree. Then, the most promising 
new position is chosen, and a rollout phase (3) is performed by randomly 
sampling transformations from the rollout policy until all molecules 
are solved or a certain depth is exceeded. In the update phase (4), the 
position values are updated in the current branch to reflect the result of the 


(1) Selection. In the first 3N-MCTS phase, starting at the root node 
(the target molecule) of the search tree, the algorithm sequentially 
selects the most promising next position within the tree until a leaf 
node is reached (Fig. 2a). The algorithm balances the selection of high- 
value positions and unexplored positions. If a leaf node is visited for 
the first time, it is directly evaluated by a rollout. If it is visited for the 
second time, it is expanded by processing via the expansion policy. 

(2) Expansion. Now, the possible transformations determining the 
follow-up positions of the current position are selected by applying the 
expansion procedure. The predicted follow-up positions are added to 
the tree as children of the leaf node, and the most promising position 
is selected for rollout. 

(3) Rollout. This phase starts with checking the status of the position. 
If it is already solved, the algorithm directly receives a reward greater 
than 1 to encourage exploitation. Non-terminal states are subjected to 
a rollout, where actions are sampled from the rollout network recur- 
sively, until the state has been deconstructed into building blocks or a 
maximal depth is reached. 

(4) Update. If a solution has been found during rollout, a reward 
of 1 is received. Partial rewards are given if some, but not all, mole- 
cules in the state are solved. If no solution was found, a reward of -1 
is received. Here, bespoke scoring functions for the problem at hand, 
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rollout. b, Expansion procedure. First, the molecule (A) to retroanalyse is 
converted to a fingerprint and fed into the policy network, which returns a 
probability distribution over all possible transformations (T; to T,,). Then, 
only the k most probable transformations are applied to molecule A. This 
yields the reactants necessary to make A, and thus complete reactions Ry 

to Ry. For each reaction, the reaction prediction is performed using the 
in-scope filter, returning a probablity score. Improbable reactions are then 
filtered out, which leads to the list of admissible actions and corresponding 
precursor positions B and C. 


such as process chemistry or small-scale medicinal chemistry, can also 
be supplied. Eventually, the tree is updated to incorporate the achieved 
reward by updating the position values. 

These four phases of 3N-MCTS are iterated until a time budget or 
maximal iteration count is exceeded. Finally, to obtain the synthesis 
plan, we repeatedly select the retrosynthetic step with the highest 
value until a solved position is reached, or a maximum depth has been 
exceeded, in which case the problem is unsolved. 


Evaluating the performance characteristics of 3N-MCTS 
To evaluate the performance of 3N-MCTS, we compare our algorithm 
to the state-of-the-art search method, which is BFS with the hand- 
coded SMILES*” heuristic cost function (‘heuristic BFS’)°. This func- 
tion assigns the lowest cost to steps that split up the molecule into 
equally sized parts. Additionally, we perform BFS with the cost cal- 
culated by the policy network (‘neural BFS’). All algorithms use the 
same set of automatically extracted transformations. The evaluation is 
again time-split, as follows. Models were trained only on data published 
before 2015. As test data, only molecules first reported in or after 2015 
were considered (which were not contained in the training dataset). 
Provided with the target molecules, the algorithms then had to find a 
synthesis route to given building blocks. 
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Figure 3 | An exemplary six-step synthesis route for an intermediate in 
a drug candidate synthesis. This route is identical to the published one** 
and was found by our algorithm autonomously within 5.4 s. The affected 
functional groups in each step are marked blue or red. 


Figure 3 shows an exemplary six-step route for an intermediate of 
a drug candidate synthesis reported in 2015, which was found by our 
algorithm in 5.4s. It matches the published route. Several hundred 
additional exemplary retrosynthetic routes found by the MCTS algo- 
rithm for molecules first synthesized in or after 2015 are deposited in 
Supplementary Information (see also Extended Data Fig. 2). 


Quantitative evaluation 

Surprisingly, in the past, neither hand-coded nor automatically extracted 
retrosynthetic systems have been validated at scale in a statistical way. 
We quantitatively assessed the performance characteristics of the differ- 
ent search algorithms by finding synthesis routes for 497 diverse mole- 
cules first reported in or after 2015 to known building blocks (see Fig. 4). 

MCTS already solves more than 80% of the test set with a time limit 
of 5s per target molecule, compared to 40% with neural BFS and 0% 
for heuristic BFS. MCTS solved 92% of the test set with a limit of 60s 
per molecule, whereas neural BFS solved 71%, and heuristic BFS solved 
4%, Even at much longer runtimes of 20 min per molecule, heuristic 
and neural BFS are not able to compete with MCTS. Provided with 
infinite runtime, however, the algorithms will converge to the same 
performance. The molecules that MCTS failed to solve could not be 
solved by the BFS algorithms either. When looking beyond the first 
(top 1) retrieved route, MCTS and BFS find similar alternative routes, 
and do not differ much in terms of route diversity (see Supplementary 
Information section 2). 

To determine which MCTS components are responsible for its 
superior performance, we compared MCTS against several related 
search algorithms (see Table 1) at a runtime limit of 3 x 300s (three 
restarts). MCTS in conjunction with the expansion policy network 
solved the highest number of retrosynthetic targets. On average, 
MCTS required the least amount of time per molecule to find a 
solution (entry 1). Plain Monte Carlo search randomly selects trans- 
formations using the expansion policy network, without building a 
tree. The Monte Carlo search (entry 2) solved 89.54% of the test set. 
UCT is an MCTS variant that uses the expansion policy network only 
to narrow down the possible transformations, but not to guide the 
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Figure 4 | Influence of the time per query on performance. The 
maximum number of steps is 100,000. 


search via the predicted probability of the transformation”®. In this 
way, 87.12% of the test set is solved. BFS using a cost function based 
on the expansion policy network solved only 84.24%, highlighting the 
importance of rollouts. The traditional approach, BFS with a hand-de- 
signed heuristic cost function, solves only 45.6% of the test set, and 
needs 433.4 s on average to find a solution. These results suggest that 
all components of 3N-MCTS (building a tree, reducing the branching 
factor via the expansion policy, guiding the search with the expansion 
policy, and using rollouts) contribute to its superior performance. We 
also found 3N-MCTS to be robust towards the choice of the MCTS 
parameters (see Supplementary Information section 1). 


Assessing route quality via double-blind AB tests 

The central criticism of retrosynthesis systems has been that the pro- 
posed routes often contain what chemists immediately recognize 
as chemically unreasonable steps. Therefore, to assess the quality of 
the solutions we conducted two AB tests, in which 45 graduate-level 
organic chemists from two world-leading organic chemistry institutes 
in China and Germany had to choose one of two routes leading to 
the same molecule on the basis of personal preference and synthetic 
plausibility. The tests were double-blind, meaning that neither the 
participants nor the conductors were aware of the origin of the routes. 
The test molecules were selected randomly from a set of drug-like com- 
pounds first published in or after 2015 (see Supplementary Information 
for the entire list of the targets and routes). 

In the first test, the participants had the choice between a route 
reported by expert chemists in the literature, and a route generated 
by our 3N-MCTS algorithm for the same target molecule. Routes to 
nine different target molecules were offered. Routes towards the same 
molecule were required to have the same number of steps. 

Here, one might expect the participants to clearly identify the routes 
suggested by the machine as inferior. Surprisingly, this is not the case. 
We found that the experts did not significantly (P= 0.26) prefer the 
literature route (43.0%) over our program’s route (57.0%). Figure 5a 
shows the preference ratios for the individual routes. Here, the prefer- 
ence is generally balanced, with a slight trend towards MCTS. In some 
cases, the participants have clear preferences (see Fig. 5c and Extended 
Data Fig. 3 for examples where MCTS was not preferred). 


Table 1 | Experimental results 


Search Percentage Time (seconds 
Entry method Policy* solved per molecule) 
1 MCTS Neural 95.24+0.09 13.0 
2 MC Neural 89.54+0.59 275.7 
3 UCT Neural 87.1240.29 30.4 
4 BFS Neural 84.24+0.09 39.1 
5 BFS SMILES9/2 55.53 + 2.02 422.1 
The time budget was 300 s and 100,000 iterations for MCTS or 300 s and 100,000 expansions 
for BFS, per molecule. Three restarts were carried out. 
*In the BFS, this is the cost function. 
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Figure 5 | Double-blind AB testing of MCTS-derived routes against 
literature and BFS routes. a, Chemists did not significantly prefer 
literature routes over routes found by MCTS (Wilcoxon signed-rank test, 
P=0.26). b, Chemists significantly prefer routes found by 3N-MCTS 

over routes generated by heuristic BFS without a policy network and an 
in-scope filter (Wilcoxon signed-rank test, P=0.01). A ratio above 0.5 
indicates that more than 50% of participants preferred the MCTS solution. 
c, In this AB example (task 1 of test a), the chemists preferred the literature 
route proceeding via a Grignard reaction, in contrast to the MCTS route, 


In the second test, the participants had to report their prefer- 
ences for either routes found by 3N-MCTS or routes generated by 
a baseline system, which uses heuristic BFS and the same transfor- 
mation rules as 3N-MCTS. However, it lacks a policy network to 
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which was proposed to proceed via Seyferth-Gilbert homologation with 
the Ohira-Bestmann reagent. Although the MCTS route is chemically 
reasonable, it uses less-conventional chemistry in this case. The 
subsequent key steps to build the annulated cycle are the same for both 
MCTS and the literature. d, Without applying the expansion policy and 
the in-scope filter to select the best reactions, heuristic BFS produces the 
typical errors traditionally criticized in retrosynthetic systems. That is, the 
expert system tries to apply rules that are overgeneral and will not work in 
this molecular context. 


preselect promising transformations and an in-scope filter to exclude 
unlikely steps. Here, the participants significantly (P = 0.01) pre- 
ferred the routes generated by the MCTS algorithm (68.2%) over 
the baseline system (31.8%). We attribute the preference towards the 
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3N-MCTS-generated routes to lower frequencies of unreasonable 
steps (see Fig. 5b and d). 


Discussion 

We have shown that MCTS combined with deep neural networks and 
symbolic rules can be used effectively to perform chemical synthesis 
planning. In contrast to earlier work, our purely data-driven approach 
can be initially set up within a few days without the need for tedious 
and biased expert encoding or curation, and is applicable to disci- 
pline-scale datasets. Our approach solves more problems and is faster 
than established search methods. Furthermore, it also performs better 
qualitatively. In the past, retrosynthetic systems have been criticized for 
producing more noise than signal. We observed that traditional heuris- 
tic BFS without neural network guiding did lead to many unreasonable 
steps being proposed in the routes, while the 3N-MCTS approach pro- 
posed more reasonable routes. This is supported by double-blind AB 
experiments, where the participating organic chemists showed clear 
preference towards 3N-MCTS over the traditional approach. Finally, 
our double-blind AB tests suggest that, for the first time, organic chem- 
ists should consider the quality of retrosynthetic routes generated by 
a machine to be on par with reported routes for molecules of practical 
relevance. 


Limitations and frontiers 

Nevertheless, it would be premature to consider computer-aided syn- 
thesis a solved problem, as challenges remain. First, natural product 
synthesis is currently beyond the capabilities of our method. The spar- 
sity of the training data in this area remains a fundamental challenge for 
deep learning approaches*®. However, natural products are also chal- 
lenging for the best human chemists, as they can behave unpredictably, 
and often require intense methodology development*®. Natural product 
synthesis may be solvable by stronger, but slower-reasoning, algorithms 
that could be used to invent reactions*"*”, 

Another important challenge is the reliable prediction of stereo- 
chemical outcomes. While our approach is able to treat stereo- 
information, the most important part, predicting enantiomeric or 
diastereomeric ratios quantitatively, remains an open challenge. 
Convincing global approaches for the quantitative prediction of 
enantiomeric or diastereomeric ratios over a wide range of differ- 
ent reactions without recourse to expensive quantum-mechanical 
calculations** have not been reported. However, they could be 
addressed with stereochemistry-aware descriptors. Furthermore, 
our system currently does not take reaction mechanisms, equilibria 
between different forms, such as tautomers, or three-dimensional 
structures into account, which can be crucial in natural product 
synthesis. Also, we do not at present perform reaction condition 
prediction”. 


Outlook 

For the past 60 years, experts have been trying to dictate the rules of 
chemistry to computers via hand-coded heuristics. Instead, we antici- 
pate that equipping machines with strong, general planning algorithms, 
symbolic representations, and the means to learn autonomously from 
the rich history of chemistry will be crucial to allowing the machine to 
become accepted as a valuable assistant in chemical synthesis, which 
is central to solving humanity's most pressing problems in agriculture, 
healthcare and material science. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Chemistry. Molecules are stored in the search tree as canonical SMILES strings. If 
stereoinformation is desired, molecules are stored as canonical, isomeric SMILES. 
For processing, molecules are translated from SMILES into molecular graphs, 
which are vertex-labelled and edge-labelled graphs m= (A, B), with atoms a € A 
as vertices and bonds b € Bas edges. Retrosynthetic transformation rules are pro- 
ductions on graphs*!. In chemical terminology, transformations are also referred 
to as ‘named reactions. The Chemistry Development Kit (CDK)** and RDKit®* 
chemoinformatics libraries were used for the implementation. 

Retrosynthesis as a Markov decision process. Markov decision processes (MDPs) 
model sequential decision processes of an agent in an environment”’. An MDP is 
atuple(S, A, T, R), with states (positions) s € S, actions (transformations) a € A, 
a transition model [(s, a, s’) determining the probability Pr(s’|a, s) of reaching state 
s' when taking action a in state s, and a reward function R(s, a, s’), which returns 
the reward when transitioning to s’ via action a in state s. A policy 7(a|s) is a prob- 
ability distribution over all actions given state s. 

Unlike in games, such as chess or Go, where it is trivial to write down the ground 
truth rules (the model) of the game, querying the ‘chemical environment to find 
out whether an action actually leads to the desired successor state is expensive. 
Either a wet-laboratory experiment has to be conducted or a quantum-chemical 
calculation on a high level of theory has to be run, which usually takes longer than 
running the laboratory experiment. Learning from millions of episodes of self-play 
can therefore not be employed. 

To avoid these expensive interactions, we therefore need to learn or construct 
a model of the environment to perform planning. As elaborated in the introduc- 
tion, this model will be inaccurate?***. Even the best human chemists’ predictions 
can and do fail, which implies that humans also perform synthesis planning with 
inaccurate mental models of the chemical environment. Here, we use automatic 
rule extraction to determine the action set (the transformations) and the in-scope 
filter network to learn a transition model (which is applied in a binary way). The 
expansion policy network serves as a prior policy. 

In this Article, a state (position) s € S is a set of molecules s= {m;, Mj, ...}. The 
initial state so = {mmo} contains only the target molecule mp. Actions are then trans- 
formations (rules or productions on graphs*!) applied to one of the molecules 
m ina state s. When applying a legal action aj to a state s;= {mg, mp, me}, it will 
produce a new state, for example, s;= {mg, me, mp, m<}. 

Given a set of building-block molecules B (specified before the start of the 
search), state s; is solved if all molecules m; in s; are building blocks m; € B. A state 
is terminal either if it is solved or if no legal actions are available. Ideally, it should 
be possible to provide the set of building blocks 6 dynamically before the search 
is started. Otherwise, the system could not adapt, for example, when a building 
block runs out of stock. Also, a researcher may choose different sets of building 
blocks for each search, for example, by first trying to find the solution to a problem 
with molecules that are in stock in the laboratory, and afterwards considering 
additional molecules from chemical suppliers. This makes it challenging to define 
or learn value functions, because a changed set of building blocks changes the 
terminality of states and the reward function, which entails a change of the value 
function’. A further challenge is that the initial state (the target molecule) changes. 
In most games, the reward function is always the same (the rules never change, 
and a terminal state is always terminal). 

The size of the state space can only be roughly estimated. The number of drug- 
like molecules, which contain a restricted set of elements and functional groups, 
might already exceed 10° molecules*. However, this number excludes synthetic 
intermediates and organometallic and organo-main group chemistry, which add 
orders of magnitude to the state space size. The action space is formed by the trans- 
formations available to the system, and the legal actions are those actions that can 
be applied to the molecules in a state via subgraph isomorphism. Unlike for other 
game artificial intelligence problems, in retrosynthesis we can limit the depth of 
the tree to a relatively small number (here, 25) and abort the simulation as failed if 
a viable route cannot be found within this limit. Our trees are thus wider and less 
deep than for other applications. 

MCTS. MCTS is a reinforcement learning approach that combines tree search 
with learning from simulated episodes of experience, which are obtained from 
interacting with a model of the environment”*”’. MCTS has been successfully 
applied in problems of sequential decision making in many domains, such as games 
or automated theorem proving”**°, MCTS has several desirable features, which 
makes it particularly well suited for retrosynthesis. It allows the calculation of value 
functions focused on a particular initial state on the fly*4, and therefore does not 
depend strongly on heuristics. Each edge (s, a) in the search tree stores the action 
value Q(s, a), the visit count N(s, a) and a prior probability P(s, a) received from 
the expansion policy network. 

Selection. In the first MCTS phase, starting at the root node, the tree policy 
(equation (1)) is used to select actions. The simulation descends the tree step by 
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step. At each step tf, the next action a; is selected from all available actions A(s;) 
in s; by equation (1), where N(s;_}, a—1) is the visit count of the state-action pair 
that led to the current state, and c the exploration constant. 


a; = argmax Asia) + cP(st, a) NNGst-v ar) (1) 
acA(s;) N(s;; 4) 1+ N(s;, a) 


The inclusion of the prior probability P(s, a) in the second term of equation (1) 
allows the system to explore the most promising lines of analysis first®”. With 
repeated visits this term decays, allowing for the exploration of other options. 
Additionally, this allows one to take into account the confidence in the evaluation 
obtained via the rollout, which is expected to be noisy. 

The tree policy is applied until a leaf node or a terminal node is found. If a leaf 

node is visited for the second time, it is expanded. Then, all non-building-block 
molecules m; € s; are processed by means of the expansion procedure. The resulting 
state-action pairs are added to the tree as children of the leaf node, and the most 
probable action according to the policy network is selected for rollout. 
Expansion. During expansion, the state is processed once via the expansion pro- 
cedure, and the reduced top 50 successor states are directly added to the tree. This 
trick can be applied because retrosynthesis is a single-player game, and we do not 
have to fear overlooking ‘killer’ moves (trap moves in which a small mistake will 
be exploited directly by the opponent) as much as in two-player games. Using only 
the reduced top 50 entails that the NP-complete subgraph isomorphism problem, 
which determines whether the corresponding rule can be applied to a molecule and 
yields the next molecule(s), needs only to be solved for at most 50 rules, instead of 
for all rules in the transformation rule set. 
Evaluation by rollout. Before starting the rollout, the state is first checked for 
being terminal. A state can be terminal if it is solved. States within the tree that are 
already solved are called ‘proved’”**. States can also be terminal if no legal actions 
are available in that state. Terminal states are directly evaluated with the reward 
function. If the state is non-terminal, a rollout is started. During rollout, actions 
are sampled recursively for each molecule in the state from the top 10 actions of 
the rollout policy until it has been deconstructed into building blocks or a maximal 
recursive function call depth of d, is exceeded. For the sake of simplicity, rollouts 
that completely solve the molecule are currently not stored. 

The reward function r(s) returns z> 1 if the state is proved to encourage 

exploitation, a reward € [0,1] depending on the ratio of molecules solved during 
rollout, and —1 if the state is terminal and unproven, or unsolved during rollout. 
Learned value functions are a possible, future alternative to rollouts. Investigations 
to learn value functions are currently ongoing in our laboratory. 
Update. In the update phase, the action values Q(s, a) and visit counts N(s, a) of 
the edges traversed in the branch from s, to the root node are updated. The edges 
gather the mean action value as in equation (2), where the indicator function I;(s, a) 
is 1 if the edge was played during the ith simulation and z; is the reward received 
during rollout. 


Qs,a)=—* — 7 is, a)ziw(bi) (2) 
i=l 


Ns, a) = 


Here, it is also possible to inject custom objective functions W(b;) > R that might 
assign higher rewards, for example, to shorter, convergent, atom-economic or 
confident branches b;. In this work, we adjust the reward by using 


J 
&(bi) = length(b;) — S> kP(s;, aj) (3) 


j=l 


(4) 


W!max(b;) _ max 41 a 


max 


where J=length(-) denotes the length of a branch, P(sj, aj) is the probability of the 
jth action in the branch obtained from the expansion policy, k=0.99 is a damp- 
ing factor, and the maximal branch length Lmax= 25. Here, inclusion of the prior 
policy P(s;, aj) allows us to bias the reward also towards more confident branches. 
An interesting scoring function to investigate further is similarity to reported 
reactions”. 

After either the time or the iteration step budget has been exhausted, the 
synthesis plan is selected, starting at the root node, by greedily choosing the 
action with the highest action value until a terminal solved state is reached, or 
a maximum depth has been exceeded, in which case the problem is unsolved. 
A maximal rollout depth of 5, an exploration constant of 3 and a reward, when 
proved, of 10 were employed as the MCTS parameters in the quantitative and 
qualitative experiments. 
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Automatic transformation rule extraction. Formalizing chemical knowledge by 
hand has been attempted. Even though it sounds simple to write down the rules 
of chemistry, it takes years to formalize only humble knowledge bases, it is error- 
prone and biased towards the knowledge of the encoding experts, and in many 
cases chemical systems are too complex or just not well enough understood to 
formally write down their limitations and scope. Like rule-based common-sense 
reasoning”, this approach is considered to have exhausted its potential!>!718. 
Additionally, given the exponential growth of chemical knowledge (it doubles 
roughly every 15 years), manual encoding is a hopeless endeavour. 

Following our previously reported procedure”’, and building on previous 
work?°-2, transformation rules were therefore extracted automatically. The rules 
are stored using the RDKit reaction SMARTS format™. A very general rule defini- 
tion was employed for the expansion rule set, where only the atoms of the reaction 
centre (including implicit hydrogen atoms and neighbouring-atom count) were 
extracted. The rules in the rollout set contain the reaction centre atoms (with 
implicit hydrogen atoms and neighbouring-atom count) and additionally the 
directly neighbouring atoms of the atoms in the reaction centre with their implicit 
hydrogen atom count. Rules were extracted only from single-step reactions with 
one, two or three reactants and a single product. As this work is a proof-of-concept 
study with the intent to radically avoid expert encoding and curation, we chose not 
to exclude reactions based on low yield or extreme reaction conditions, as these are 
quite subjective criteria. For example, a yield of just a few per cent can be sufficient 
if the aim is to obtain only a few milligrams of a compound for biological testing, 
while 90% yield is clearly unsatisfying if quantitative alternatives are available. In 
the future, more sophisticated approaches based on reaction classification could 
be employed to extract rules, for example, by grouping together similar leaving 
groups into a single rule*"*"*, Also, further investigation of directly translating 
from products to reactants using neural networks is called for®. 

The general advantages and limitations of automatic rule extraction have been 
discussed in detail elsewhere’? !>°!3. Its main disadvantages (defining the scope 
of reactions and competing reactivity, incorporating mechanistically needed/acti- 
vating functional groups, and deciding which rules to apply first) can be addressed 
by learning supervised policies to predict which rules to apply”’. 

The use of symbolic rules has the great advantage that it is deeply rooted in 
chemists’ language. This makes it easy for the model to communicate its results to 
the human user. Furthermore, because the transformations were extracted from 
the literature, we can link back directly to literature precedents, which is of crucial 
importance for chemists. 

We note that even when taking all pre-2015 rules without count restriction 
into account, only 82% of the reactions published in and after 2015 are covered. 
The missing 18% are novel reaction types. This highlights the success of chemists 
inventing novel methodologies, but also implies that eventually a retrosynthesis 
system should also be able to discover novel reactions on its own”>. 

Policy networks. The neural policy networks were trained by minimizing the neg- 
ative log-likelihood of selecting the transformation a that was used in the literature 
to make molecule m. This is essentially supervised multi-class classification. To 
evaluate the accuracy, reactions that are not covered in the rule set are excluded. 
Training was carried out using stochastic gradient descent (ADAM optimizer™) 
within 1-2 days on a single NVIDIA K80 graphics processing unit. The Keras 
neural network framework was employed, using Theano as the backend®™®, 
Expansion policy network. Molecules are represented by real vectors in the form 
of counted extended-connectivity fingerprints (ECFP4)®, which are first modulo- 
folded to 1,000,000 dimensions, and then In(x + 1)-preprocessed. After that, 
a variance threshold is applied to remove rare features, leaving 32,681 dimensions. 
For the machine-learning model, we used a 1+-5-layer highway network with 
exponential linear unit (ELU) nonlinearities*”**. A dropout ratio of 0.3 was applied 
after the first affine transformation to 512 dimensions, and a dropout ratio of 
0.1 was applied after each of the five highway layers. The last layer of the neural 
networks is a softmax, which outputs a probability distribution over all actions 
(transformations) p(a|m), which forms the policy (see Extended Data Fig. 4). 
Rollout policy network. For rollout, molecules are represented by counted ECFP4 
fingerprints’, modulo-folded to 8,192 dimensions, and then In(x + 1)-preprocessed. 
As the rollout policy, a neural network with a single hidden dense layer with a 
dimensionality of 512, and ELU nonlinearity and dropout of 0.4 was used. 
In-scope filter. The function of the in-scope classifier is to rapidly filter out the 
nonsensical reactions that plague rule-based systems, such as incorrect regioiso- 
mers in electrophilic aromatic substitutions. For this purpose, we chose a binary 
classifier, which is fast to evaluate. The investigation of more sophisticated, but 
slower, reaction prediction approaches is left to future work!8734148-79. For the 
same reason, only the product and the reaction fingerprint serve as inputs to the 
classifier, although the exclusion of conditions makes the classifier underdeter- 
mined*!”. Reactions can selectively lead to different products under different 


conditions. However, the inclusion of reaction conditions as another input feature 
would require additional search in condition space at each step, which is not feasi- 
ble given the time constraints we imposed here. In tasks where search time is not 
a constraint, such as in process development, reaction condition prediction could 
be performed at each step of the search, or be performed in a second sweep. One 
way to predict reaction conditions might be via reaction similarity search, which 
is related to how chemists use reaction databases. 

Our classifier is a neural network with two branches (see Extended Data 
Fig. 4). The first branch embeds the reaction 1;, represented as a counted ECFP4 
reaction fingerprint*°*7!-” p; modulo-folded to 2,048 dimensions, via a single 
dense ELU layer. The second branch embeds the product, represented as a counted 
ECFP4 fingerprint y;, modulo-folded to 16,384 dimensions and In(x + 1)- 
preprocessed, through a 5-layer ELU-highway network. The cosine proximity of 
these embeddings is then fed into a sigmoid unit to predict the probability that the 
reaction gives rise to the expected product. 

We used two strategies to obtain negative data, as follows. First, 30 million incor- 
rect reactions were obtained by the application of reaction rules to the reactants 
of reported reactions, using the same rule set as for the expansion policy*!””*. 
Here we make the assumption that the reactants can only react in the reported way. 
Any product generated by rule application not matching the reported product is 
considered to be a failed product. With this approach, for example, wrong regi- 
oisomers can be generated. We note that these ‘negative’ reactions generated in 
this way capture the cases where a naive, contextless rule-based system would fail. 
Furthermore, 70 million negative training data points were generated by perturb- 
ing tuples (p;, @;) to (p;, oj), where ij, by random sampling. Random sampling 
gives a small performance boost of 0.0025 score points in the area under the 
receiver operation characteristic curve (AUROC) and 0.0072 points in the area 
under the precision-recall curve. Training data were generated only from reac- 
tions published before 2015, while test data were generated from data published 
in or after 2015. The classifier was trained by minimizing negative log-likelihood 
using the ADAM optimizer. Extended Data Fig. 1 shows the receiver operation 
characteristic curve of the classifier. A value of 0.9 was selected as the decision 
threshold for our classifier. Here, the classifier has a false negative rate of 14% and 
a false positive rate of 1.5% 

Given that the in-scope filter learns to embed molecules in a vector space close 
to the reactions that were used to make them, it would be interesting to investigate if 
this can be directly used for nearest-neighbour search of molecules in reaction-rule 
space, which can also be described as label embedding. 

Rediscovering electronic properties. To study what the in-scope filter has learned, 
we conducted two experiments. First, we studied Diels-Alder reactions of cyclo- 
pentadiene with various dienophiles. The reactions were submitted to the in-scope 
filter, and the raw logit scores (the output of the neural network before applying 
the final sigmoid function) were calculated. As a comparison, the structures of 
the dienophiles were optimized and the energies of the LUMO were calculated via 
density functional theory (BP86-D3BJ/def2-SVP) to capture the qualitative trend, 
using the ORCA3 software’. Extended Data Fig. 5a shows the correlation of the 
LUMO energy with the logit score, which has an r? = 0.74. Additionally, para- 
brominations via electrophilic aromatic substitution were studied (Extended Data 
Fig. 5b). Here, the logit scores were correlated to Hammond electronic parameters, 
and an r =0.78 was found. This indicates that the in-scope filter correlates with 
basic electronic properties, following the expected behaviour of the respective reac- 
tion mechanisms. This is remarkable, as its input features (ECFP4-based molecular 
and reaction fingerprints) do not contain electronic information. 

Performance evaluation studies. Baselines. In BFS, each branch is added to a 
priority queue, which is sorted by cost. In heuristic BFS, this cost function is the 
SMILES*” heuristic, which is used as reported’. In neural BES, the cost is calcu- 
lated as f(b) = se oh (1 — P(a;|s;)), where P(a;{s;) is the probability of that trans- 
formation calculated by the expansion policy. Evaluation studies were performed 
using the central processing unit of a 24-core commodity cluster node using a 
single search thread. To provide a more meaningful comparison no graphical pro- 
cessing unit was used for the evaluation studies. 

Building block and test molecule selection. The building blocks have to be selected 
before the search is started, and could be molecules on stock in the laboratory, 
known in the literature, or commercially available chemicals. Here we use a set of 
423,731 molecules, containing 84,253 building blocks from three major chemical 
suppliers (SigmaAldrich, AlfaAesar and Acros), obtained from the ZINC database 
(http://zinc15.docking.org/) and 339,478 molecules from the Reaxys database, 
which have been used as reactants at least five times before 2015. To obtain a set of 
target molecules for the quantitative evaluation that contains different scaffolds, 
first all molecules reported after 2014 were clustered using the ECFP6-based Butina 
algorithm”®, Then, 497 target molecules were randomly selected from amongst the 
82,673 different cluster cores. 
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AB testing. The participants in the AB tests were 45 postgraduate students who 
had specialized in organic chemistry at the Institute of Organic Chemistry at 
Westfalische Wilhelms-Universitat Miinster and Shanghai Institute of Organic 
Chemistry. The study was conducted in a double-blind setup. During the test, 
neither the participants nor the conductors were aware of the origin of the route. 
Statistical significance was tested via the Wilcoxon signed-rank test. 

3N-MCTS versus literature. In the comparison of 3N-MCTS with the literature, 
the expectation would be that experts prefer the literature option. In 128 AB tests, 
the experts preferred the literature route in 43.0% and MCTS in 57.0% (Wilcoxon 
signed-rank test on paired data, P=0.26). The null hypothesis that both datasets 
stem from the same source cannot be rejected. 

3N-MCTS versus heuristic BFS. Here, 68.2% of the participants preferred 3N-MCTS 
generated solutions, whereas only 31.8% preferred heuristic-BFS-generated 
solutions in 129 submitted tests. The experts strongly favour MCTS, the null 
hypothesis of indistinguishable sources (50% preference for each) can clearly be 
rejected (Wilcoxon signed-rank test on paired data, P=0.01277). 

Data availability. The reaction dataset used in this study is provided by Elsevier 
Information Systems GmbH under licence. 
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Extended Data Figure 1 | Receiver operation characteristic curve for the in-scope filter. The area under the curve is 0.99. 
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Extended Data Figure 2 | An exemplary 10-step synthesis route for a complex intermediate in a drug synthesis. It resembles the published route”” 
(with intermediates A and B) and was found by our algorithm autonomously within 30s. The target was not contained in the training set. 
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Extended Data Figure 3 | Example of reaction used in the AB testing, where the MCTS-derived route was less favoured. In this task, the participants 
preferred the literature solution, as its key step was presumably perceived to be more convergent. 
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Extended Data Figure 4 | Architectures of the employed neural networks. (‘dim, dimensions.) 
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Extended Data Figure 5 | Rediscovering physicochemical properties (r? = 0.74) (a) and with empirically measured Hammond parameters in 
with the in-scope filter. The output logit score of the neural network electrophilic brominations (7? = 0.78) (b), even though the input features 
correlates surprisingly well with calculated quantum-mechanical (ECFP4 fingerprints) do not contain electronic information. 

properties (LUMO energies, in Hartree) in Diels-Alder reactions 
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Extended Data Table 1 | Metrics for the supervised neural network policies 


Policy # rules Coverage Matching rules/mol® Accuracy? topl0Acc? _ top50Acc# 
Expansion 301,671 0.79 46,175 0.310 0.633 0.725 
Rollout 17,134 0.52 321 0.501 0.891 0.964 


Top10Acc/top50Acc is the ratio of correct/incorrect predictions if we allow the system to make 10 or 50 predictions. 


*Accuracy is calculated on the molecules covered by the respective rulebase. 
’Matching rules/mol corresponds to the branching factor. 
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De novo mutations in regulatory elements 
in neurodevelopmental disorders 


Patrick J. Short!, Jeremy F. McRae!, Giuseppe Gallone!, Alejandro Sifrim!, Hyejung Won’, Daniel H. Geschwind**, 
Caroline F. Wright!®, Helen V. Firth!®, David R. FitzPatrick’, Jeffrey C. Barrett! & Matthew E. Hurles! 


We previously estimated that 42° of patients with severe developmental disorders carry pathogenic de novo mutations 
in coding sequences. The role of de novo mutations in regulatory elements affecting genes associated with developmental 
disorders, or other genes, has been essentially unexplored. We identified de novo mutations in three classes of putative 
regulatory elements in almost 8,000 patients with developmental disorders. Here we show that de novo mutations in highly 
evolutionarily conserved fetal brain-active elements are significantly and specifically enriched in neurodevelopmental 
disorders. We identified a significant twofold enrichment of recurrently mutated elements. We estimate that, genome- 
wide, 1-3% of patients without a diagnostic coding variant carry pathogenic de novo mutations in fetal brain-active 
regulatory elements and that only 0.15% of all possible mutations within highly conserved fetal brain-active elements 
cause neurodevelopmental disorders with a dominant mechanism. Our findings represent a robust estimate of the 
contribution of de novo mutations in regulatory elements to this genetically heterogeneous set of disorders, and emphasize 
the importance of combining functional and evolutionary evidence to identify regulatory causes of genetic disorders. 


The importance of non-coding variation in complex disease has been 
well established—most disease-associated common SNPs lie in inter- 
genic or intronic regions, albeit with low effect sizes”. Rare sequence 
and structural variants in relatively few regulatory elements have been 
causally linked to Mendelian disorders*°. These pathogenic regulatory 
variants can act by loss of function®”® or gain of function'®"' and most 
act dominantly, with a few exceptions’”. These regulatory elements can 
lie far from the gene they regulate. For example, sequence variants in 
an evolutionarily conserved regulatory element located 1 Mb from its 
target gene, SHH, can cause polydactyly’®. As a consequence, it can be 
challenging to identify the gene whose regulation is being perturbed 
by an associated regulatory variant!?!°. Moreover, the contribution 
of highly penetrant mutations in regulatory elements to genetically 
heterogeneous rare diseases, such as neurodevelopmental disorders, 
has not been firmly established. 

We recruited 7,930 individuals with a severe, undiagnosed develop- 
mental disorder, and their parents to the Deciphering Developmental 
Disorders (DDD) study from clinical genetics centres in the UK and 
Ireland. Systematic clinical phenotyping’* identified 79% with cog- 
nitive impairment or abnormality of the brain, which we refer to as 
neurodevelopmental disorders. Congenital heart defects (CHD) were 
the most prevalent non-neurodevelopmental phenotype, present in 
10% of the cohort. Exome sequencing of the first 4,293 families in a 
previous analysis revealed that about 25% of probands carry damaging 
de novo mutations (DNMs) in genes associated with developmental 
disorders, accounting for the majority of diagnostic variants!”'*. An 
additional 17% of probands carry pathogenic DNMs in genes not yet 
robustly associated with developmental disorders'®. Thus the majority 
of the probands do not carry a diagnostic variant in a protein-coding 
gene, and are termed ‘exome-negative. To explore the role of DNMs 
in non-coding elements, we performed targeted sequencing on three 
classes of putative regulatory elements: 4,307 highly evolutionarily 


conserved non-coding elements (CNEs)!’, 595 experimentally 
validated enhancers”°, and 1,237 putative heart enhancers”!, together 
covering 4.2 Mb of sequence with comparable depth of coverage to 
protein-coding regions (Extended Data Fig. 1, Supplementary Table 1). 
Furthermore, we define a set of ‘control’ intronic elements covering 
6.03 Mb (see Methods). 


Selective constraint acting on non-coding elements 

We first assessed how much purifying selection had skewed allele 
frequencies in non-coding elements. We used the mutability- 
adjusted proportion of singletons (MAPS) metric” in 7,080 unrelated, 
unaffected DDD parents to test six different element classes: introns, 
heart enhancers, validated enhancers, CNEs, protein-coding genes, 
and genes known to be associated with developmental disorders. The 
validated enhancers from the VISTA enhancer browser vary across the 
spectrum of evolutionary conservation, while the heart enhancers are 
poorly conserved, consistent with previous reports”, and the CNEs 
show high levels of evolutionary conservation (Fig. 1a). The introns 
and heart enhancers show little evidence of purifying selection, while 
the experimentally validated enhancers and CNEs are constrained to 
a similar degree to protein-coding genes, but less than genes known to 
be associated with developmental disorders (Fig. 1b), consistent with 
evolutionary conservation maintained by purifying selection. Statistical 
power to detect functionally relevant variants in protein-coding genes 
is strengthened considerably by stratification of variants by their likely 
impact on the encoded protein and variant deleteriousness metrics 
such as CADD™*. We computed the MAPS within bins of CADD scores 
encompassing 1,520,250 variants in unaffected DDD parents to assess 
whether CADD was predictive of selective constraint. In protein- 
coding genes, the strong correlation between CADD score and 
strength of purifying selection enabled us to differentiate between 
variants that are neutral, weakly constrained, and highly constrained. 


1Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge CB10 1SA, UK. @Department of Neurology, David Geffen School of Medicine, University of California Los 
Angeles, Los Angeles, California 90095, USA. 3Center for Autism Research and Treatment, Program in Neurobehavioral Genetics, Semel Institute, David Geffen School of Medicine, University of 
California Los Angeles, Los Angeles, California 90095, USA. “Department of Human Genetics, David Geffen School of Medicine, University of California Los Angeles, Los Angeles, California 90095, 
USA. ‘Institute of Biomedical and Clinical Science, University of Exeter Medical School, RILD Level 4, Royal Devon & Exeter Hospital, Barrack Road, Exeter EX2 5DW, UK. °East Anglian Medical 
Genetics Service, Box 134, Cambridge University Hospitals NHS Foundation Trust, Cambridge Biomedical Campus, Cambridge CB2 0QQ, UK. 7MRC Human Genetics Unit, MRC IGMM, University 


of Edinburgh, Western General Hospital, Edinburgh EH4 2XU, UK. 
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Figure 1 | Selective constraint in targeted non-coding elements. 

a, Evolutionary conservation score (phastcons10019) for CNEs 

(n= 4,307), experimentally validated enhancers (VISTA; n= 595), and 
putative heart enhancers (m= 1,237). b, Strength of selection (MAPS 
metric, mean and 95% CI represented by dot and bars) in targeted non- 
coding elements compared to protein-coding regions, where “Exonic’ 
refers to all variation within protein coding-exons. Stratification based on 
synonymous/non-synonymous consequence displayed on the same row to 
illustrate power of even a simple discriminator. Introns and putative heart 
enhancers show little evidence of purifying selection while CNEs show 


In CNEs, CADD differentiates neutral variation from variation under 
weak constraint, but failed to identify highly deleterious variants with 
selective constraint on a par with protein-truncating variants (Fig. Ic, 
Extended Data Fig. 2d). Other deleteriousness metrics were assessed, 
but none were more informative than CADD (Extended Data Fig. 2a-c). 

We used DNase I hypersensitivity sites (DHS) in 39 tissues and 
chromHMM genome segmentation predictions in 111 tissues** to 
predict tissue activity for the targeted non-coding elements. Of the 
4,307 CNEs we sequenced, 4,046 (93.9%) were active in at least one of 
the 111 surveyed tissues whereas 261 (6.1%) were inactive or repressed 
in all tissues (Extended Data Fig. 2e, f). Variants within a DHS peak 
in at least one tissue were under stronger purifying selection than 
variants that did not overlap a DHS peak (P=0.019), but we did not 
identify significant differences in selective constraint between tissues 


(Fig. 1d). 


Enrichment of mutations in non-coding elements 

We identified candidate de novo single nucleotide mutations in 7,930 
trios (see Methods). We adapted a previously described model for 
germline mutation”® to include methylation status at CpG sites (see 
Methods, Extended Data Fig. 3a) and show that it better accounts for 
observed levels of rare variation than the unadapted model (Extended 
Data Fig. 3b). We tested four genomic features previously associ- 
ated with mutagenicity’ for enrichment in non-coding elements 
with DNMs and found no evidence that these genomic features were 
enriched in non-coding elements with DNMs (H3K27me3, \?- 
test P= 0.4809; H3K9me3, ?-test P=0.1966; replication timing”®, 
Extended Data Fig. 3f; recombination rate’, Extended Data Fig. 3e). 
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selection on par with all genes, but less than genes known to be associated 
with developmental disorders. c, Using CADD to stratify coding and 
non-coding variants observed in unaffected parents differentiates neutral 
variation from weakly and strongly constrained sites in coding regions, 
but fails to identify non-coding variation with selection pressure on par 
with protein-truncating variants (stop gained). d, Sites overlapping a DHS 
in at least one tissue are under stronger purifying selection than sites not 
overlapping a DHS. ES cells, embryonic stem cells; HSCs, haematopoietic 
stem cells; iPS cells, induced pluripotent stem cells. 


We identified 1,691 ‘exome-positive’ individuals with a likely patho- 
genic protein-altering DNM or inherited variant in a gene known to 
be associated with developmental disorders, with the remaining 6,239 
being ‘exome-negative. Using the mutation model, we compared the 
numbers of observed and expected DNMs in the targeted non-coding 
elements in these individuals. No significant DNM enrichment was 
observed in exome-positive probands in the targeted non-coding 
elements, demonstrating that the mutation model is reasonably 
well-calibrated and that a large proportion of exome-positive cases are 
likely to represent Mendelian syndromes caused by high-penetrance 
protein-coding mutations (Extended Data Fig. 4a). We note that 
the number of exome-positive individuals affords only limited power 
to reject modest mutation enrichment in the non-coding elements. On 
the basis of these results, we chose to focus on the 6,239 exome-negative 
individuals for subsequent analyses. 

We found that the CNEs were nominally significantly enriched for 
DNMs (422 observed, 388 expected, P= 0.04), whereas experimen- 
tally validated enhancers (153 observed, 156 expected, P=0.605), heart 
enhancers (86 observed, 86 expected, P=0.514), and intronic controls 
(901 observed, 919 expected, P=0.728) were not enriched (Fig. 2). 

Given the preponderance of individuals with neurodevelopmen- 
tal disorders in our cohort but the broad range of tissue activity of 
the targeted CNEs, we focused on CNEs that are active in the fetal 
brain. DNMs were strongly and significantly enriched within 
2,077 fetal brain DHS peaks in CNEs (177 observed, 138 expected, 
P=8.1 x 10~“*) but no enrichment in sites in CNEs falling outside 
fetal brain DHSs (245 observed, 249 expected, P= 0.608) (Fig. 2). 
We also used chromHMM” predictions of fetal brain activity and 
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Figure 2 | Enrichment of DNMs across element classes and functional 
annotations in exome-negative probands. n = 6,239. Red diamonds 
indicate observed counts, while black circles and bars indicate expected 
count and 95% CI, respectively. Targeted CNEs showed a modest 
enrichment for DNMs (422 observed, 388 expected, P= 0.04) while 
heart enhancers, experimentally validated enhancers, and control 
introns matched the null model. Observed enrichment is specific to 
CNEs predicted to be active in the fetal brain and to patients with 
neurodevelopmental disorders (238 observed, 194 expected, 

P=1.2 x 10-3). Confidence intervals and P values derived from a 
Poisson distribution. 


again identified significant enrichment of DNMs in the 2,613 fetal 
brain-active CNEs (Fig. 2). Moreover, the DNMs observed in fetal 
brain-active CNEs in exome-negative probands were at more highly 
conserved sites (Wilcoxon rank sum test on PhyloP 100-way score?’) 
compared to DNMs observed in exome-positive probands (Extended 
Data Fig. 4b). To test for as yet unknown factors causing differential 
mutability, we compared the levels of rare variation in fetal brain-active 
and -inactive CNEs in 7,509 deep whole genomes from the gnomAD 
consortium and found no evidence for a higher germline mutation rate 
in fetal brain-active elements (Extended Data Fig. 3c, d). The excess of 
DNMs observed in fetal brain-active CNEs is concentrated exclusively 
within the 79% of exome-negative probands with neurodevelopmental 
phenotypes (fetal brain DHS peaks: 147 observed, 109 expected, 
P=3.1 x 10~4; fetal brain-active by chromHMM: 238 observed, 194 
expected, P=1.2 x 1073), with no significant enrichment observed 
in those without neurodevelopmental phenotypes (fetal brain DHS: 
P=0.413; fetal brain-active by chromHMM: P=0.681) (Fig. 2). The 
highly significant and specific enrichment of DNMs in fetal brain- 
active CNEs in exome-negative probands with neurodevelopmental 
disorders is robust to Bonferroni correction for thirteen explicitly and 
implicitly tested hypotheses (see Methods, Extended Data Fig. 5a). 
Analysis of the FANTOM5* and EnhancerAtlas*? datasets suggests that 
50-70% of the fetal brain-active CNEs act as enhancers (see Methods). 

We re-evaluated the experimentally validated enhancers with func- 
tional evidence for activity in fetal brain (N= 383, 64%) and observed a 
nominally significant enrichment for DNMs only within the top quar- 
tile of evolutionary conservation (18 observed, 9 expected, P=0.01) 
(Extended Data Fig. 5b). This result suggests that even for experimen- 
tally validated fetal brain enhancers, DNM enrichment is concentrated 
within elements with strong evolutionary conservation. 

We assessed four methods of gene target prediction: Genomicus 
(based on evolutionary synteny), correlation between DNase accessibility 
and gene expression, Hi-C in fetal brain'> and choosing the closest 
gene. Genome annotations are rapidly evolving and the sensitivity and 
specificity of gene target prediction methods is not yet known. However, 
independent expression quantitative trait loci, enhancer RNA and Hi-C 
data all suggest that the closest gene is often not the target of non-coding 
regulatory variation? **, 

Across the four methods tested, the proportion of fetal brain-active 
CNEs for which a target gene was predicted was 28% (fetal brain Hi-C), 
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48% (DHS-RNA correlation), 91% (evolutionary synteny), and 100% 
(closest gene). The pairwise concordance between any two methods 
(given that both methods make a prediction) was between 17% and 35% 
(Extended Data Fig. 6a). Intersecting multiple independent methods 
may provide higher confidence predictions, but comes at a cost of sen- 
sitivity and therefore power. We did not identify any enrichment for 
DNMs in elements predicted to target genes known to be associated with 
developmental disorders, likely dosage-sensitive genes (pLI metric”’), 
or genes that are differentially expressed in the brain (see Methods, 
Extended Data Fig. 6b for Hi-C results). Elements with DNMs were 
enriched for interactions with genes that are specifically upregulated in 
early prenatal brain development*® (Extended Data Fig. 6c, Methods). 

We assessed the impact of DNMs on a set of 45 transcription 
factor binding motifs that are enriched in fetal brain-active CNEs (see 
Methods), and observed a nominally significant enrichment for DNMs 
predicted to increase binding affinity; this did not survive multiple 
hypothesis correction (Extended Data Fig. 7a—d). Given the number 
of DNMs we have identified, and the relative immaturity of in silico 
predictions of the impact of non-coding variation, it is not currently 
possible to determine precise mechanisms by which these DNMs con- 
tribute to developmental disorders. 

To explore the penetrance associated with the observed DNM enrich- 
ment in the targeted non-coding elements, we investigated potential 
overtransmission of inherited rare variants in these elements to affected 
children and found no evidence for overtransmission (Extended Data 
Fig. 7e). Furthermore, we did not detect any enrichment for rare 
variants in cis that would suggest that the DNM is acting as a ‘second 
hit’ to an already perturbed haplotype. The fold-enrichment of DNMs is 
consistent with DNMs in fetal brain-active CNEs comprising a mixture 
of 70-80% non-pathogenic DNMs and 20-30% pathogenic DNMs. 


Recurrently mutated regulatory elements 

We found a significant excess of recurrently mutated elements (two or 
more DNMs in unrelated individuals) in the fetal brain-active CNEs 
and evolutionarily conserved enhancers compared to the expec- 
tation under the null mutation model (31 observed, 15 expected, 
P=9.3 x 107) (Fig. 3a). However, no individual element exceeded a 
conservative genome-wide significance threshold of P< 1.91 x 10° 
(Bonferroni correction for independent tests on 2,613 fetal brain-active 
elements) (Fig. 3b). 

Increased power to detect locus-specific enrichments of DNMs could 
be gained from aggregating DNMs across elements that regulate the same 
target gene. However, as described above, gene target prediction lacks 
coverage and accuracy. CNEs have been shown to cluster together within 
the genome and are enriched around developmentally important genes”. 
Therefore we applied hierarchical clustering on the 2,613 fetal brain- 
active CNEs to identify 356 clusters (see Methods). We found an excess 
of recurrently mutated clusters, defined as two or more elements with at 
least one DNM in each element (11 observed, 6 expected, P=0.016). We 
did not find any element clusters with a significant excess of DNMs at a 
genome-wide significance threshold (Supplementary Table 2). 

We used chromHMM” to assign the recurrently mutated CNEs to 
a predicted chromatin state. We observed the greatest excess of DNMs 
in CNEs predicted to be enhancers (n = 9) or strongly or weakly tran- 
scribed (m= 8) (Extended Data Fig. 8). Five of the eight transcribed 
recurrently mutated elements fall in close proximity to exons, but are 
not in protein-coding transcripts and show evidence of involvement 
in alternative splicing (BCLAF1, SRRT, SLC10A7, and MKNK1) or asa 
3’ UTR (CELF1). The full set of recurrently mutated elements is 
described in Supplementary Table 3 and the location of DNMs 
relative to population variation and additional annotations is shown 
in Extended Data Fig. 9. 


Estimating genome-wide non-coding mutation burden 
The absence of individual non-coding elements with genome-wide sig- 


nificant enrichment of DNMs allowed us to place an upper bound on 
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Figure 3 | Recurrently mutated elements. a, Approximately twofold 
enrichment of recurrently mutated non-coding elements. Grey histogram 
shows distribution of expected number of recurrently mutated fetal 
brain-active non-coding elements under the null model and vertical 

line indicates observed number. b, Enrichment test of individual non- 
coding elements. No element was significant at a genome-wide threshold 
of P< 1.9 x 107° (Bonferroni correction for testing 2,613 fetal brain- 


the proportion of sites and elements in which DNMs are pathogenic. 
Approximately 8% of DNMs in protein-coding regions result in a 
protein-truncating mutation’®*”. CNEs are smaller than protein-coding 
exons (median 600 bp) and also lack annotation to identify putative 
pathogenic mutations. Down-sampling gene length to 600 bp and 
masking protein consequence annotation resulted in an 80% drop in 
empirical power for the 94 genes passing the genome-wide significance 
threshold in a previous study'* (Extended Data Fig. 10a). As we did not 
discover any genome-wide significant CNEs, the proportion of DNMs 
in CNEs that are pathogenic and highly penetrant must be substantially 
lower than 8%. We modelled the likelihood of observing 286 DNMs, 
25 recurrently mutated CNEs, and zero CNEs at genome-wide signifi- 
cance across different values for the number of fetal brain-active CNEs 
(out of 2,613) and the proportion of mutations in those elements that 
are pathogenic with a dominant mechanism for neurodevelopmental 
disorders (see Methods). The maximum likelihood model is one in 
which 3.5% of mutations within approximately 100 elements are patho- 
genic with a dominant mechanism. However, there is considerable 
uncertainty around this point estimate (Extended Data Fig. 10b), with 
the credible interval including scenarios in which tens of elements have 
around 5-7% of mutations being pathogenic or thousands of elements 
have below 1% of mutations being pathogenic. 

Our survey of the non-coding genome is biased towards highly evo- 
lutionarily conserved elements, but also includes elements with lower 
levels of evolutionary conservation. To extrapolate the excess of DNMs 
we observed in the targeted non-coding elements to a genome-wide 
estimate, we modelled the enrichment of DNMs as a function of evo- 
lutionary conservation (see Methods). Factoring in the distribution of 
evolutionary conservation of fetal brain DHS peaks genome-wide, we 
predicted a genome-wide excess of 88 DNMs (95% confidence interval 
(CI): 48-140), corresponding to 1.0-2.8% of exome-negative cases 
carrying pathogenic mutations in regulatory elements (Fig. 4b) in 
contrast to 13.4% and 28.4% carrying protein-truncating variants and 
missense variants, respectively, estimated previously’® (Fig. 4c). 


Discussion 

We have demonstrated that de novo mutations in regulatory elements 
contribute to severe neurodevelopmental disorders. These elements 
act primarily either as enhancers or to regulate alternative splicing, 
but establishing the precise mechanism for each element has proved 
challenging. This significant excess of DNMs is observed only in highly 
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active elements). Inset plots for three elements show the nearest exon or 
transcription start site, location of DNMs (red markers) with any predicted 
transcription factor binding site disruptions (gain of binding in blue, 

loss of binding in red), location of rare variants in unaffected parents 

(grey markers), evolutionary conservation (blue, higher indicates more 
conserved), and fetal brain DNase I hypersensitivity (male in pink, female 
in blue). TSS, transcription start site. 


evolutionary conserved elements that are active in the fetal brain. These 
elements also exhibit substantial selective constraint within human 
populations. We observed a 1.3-fold excess of DNMs within DHS 
peaks in these regulatory elements, suggesting that a minority of such 
DNMs are pathogenic. Moreover, our modelling suggests that there are 
few, if any, regulatory elements in which more than 4% of mutations 
cause neurodevelopmental disorders with a dominant mechanism. Our 
data are consistent with only 0.15% of mutations within fetal brain- 
active CNEs being highly penetrant for neurodevelopmental disorders 
(Fig. 4a); this is likely to be considerably lower than the proportion of 
dominant pathogenic mutations in protein-coding regions. As a conse- 
quence, this class of pathogenic non-coding DNMs is likely to account 
for only a small proportion (less than 5%) of ‘exome-negative’ indi- 
viduals, and the robust identification of disease-associated regulatory 
elements will present a greater challenge than of protein-coding genes. 

Our study design focuses on highly conserved elements and fetal 
brain-active elements, and is relatively uninformative with respect 
to pathogenic ‘gain-of-function. DNMs within elements that show 
no wild-type activity in fetal brain and are not highly evolutionarily 
conserved. While our findings have focused on the highly conserved 
elements, we do not consider our observations to be definitively 
negative about the role of less highly conserved fetal brain enhancers 
in neurodevelopmental disorders, or the role of heart enhancers in 
CHD (owing to the low proportion of subjects with CHD). The field of 
regulatory element annotation has progressed tremendously over the 
six years since this study design was initially conceived. Therefore, 
a comprehensive analysis of the contribution of variation within all 
classes of non-coding elements to neurodevelopmental disorders is 
likely to require whole genome sequencing (WGS) of many tens of 
thousands, if not hundreds of thousands, of parent-proband trios 
(Extended Data Fig. 10c). 

One challenge of interpreting WGS data is the vast universe of 
hypotheses that could be tested, and thus how to account appropriately 
for multiple hypothesis testing. A recent study reported a nominally 
significant enrichment (P = 0.03) of de novo single-nucleotide vari- 
ants (SNVs) and private copy number variants in fetal brain DHS or at 
sites with PhyloP conservation scores above 4, within 50 kb of known 
autism-associated genes in WGS from 53 individuals with autism*. 
Caution should be exercised in interpreting findings based on small 
sample sizes relative to those required for well-powered analyses (as 
discussed above) and analyses requiring multiple, arbitrary levels of 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


a 

Percentage of all mutations in fetal 
brain-active CNEs with monoallelic 
mechanism for developmental disorders 
— 0.05% 

(20004 0.10% 

8 0.15% 

2 aes 9, 

Ze 0.20% 

wo 

Sc Likelihood ratio 

co 

os 

E38 

ge 

= & 1,0001 

25 

Sec 

oO 

5= 

fe 

og 

. oO 

oO 

a 

£ 

2 

04 
0 0.025 0.050 0.075 0.100 


Proportion of mutations resulting 
In pathogenic loss/gain of function 


sv 
° 


Proportion of DDD probands 


2.66 Mb 
1.15 Mb 


Observed/expected in DDD 


Oo 


0.4 
/ 4 


Bae te 
: : : 1 : 


(0) 02 #04 O06 O08 1.0 
Evolutionary conservation of 
DHS peak (phastcons100) 


SNV 
Variant class 


Figure 4 | Modelling the proportion of DNMs in non-coding elements 
that are likely to be highly penetrant for dominant neurodevelopmental 


disorders. a, Our observation of zero non-coding elements at genome- 
wide significance in 6,239 exome-negative probands indicates that 
very few sites within these elements (<5%) are likely to contribute 

to developmental disorders through a highly penetrant dominant 
mechanism. b, Logistic regression used to model the genome- 

wide contribution of dominant-acting DNMs in fetal brain DNase 
hypersensitive sites in non-coding elements as a function of level of 
evolutionary conservation using a sliding window approach including 
1,000 elements in each bin (see Methods). Dashed lines indicate the 
upper and lower 95% CI. The bar plot shows fetal brain-active DHS 
peaks genome-wide (in megabase of total sequence) at a given level 
of evolutionary conservation. c, The proportion of probands carrying 
a pathogenic de novo SNV ina fetal brain-active regulatory element 


(1-2.8%) is far lower than the proportion carrying a pathogenic protein- 


truncating DNM (~13.4%) or missense DNM (~28.4%). 


variant stratification (for example, gene set, genomic proximity thresh- 
old, and conservation score). WGS-based analyses need to account for 


all explicit and implicit hypothesis testing. 


Our analyses were limited to SNVs as current mutation models for 
indels and structural variation are too inaccurate to allow robust assess- 
ment of mutational excess. In addition, our analyses highlight an urgent 
need for improved tools to stratify benign and damaging variants 
within non-coding elements and to annotate gene targets for regulatory 
elements. These improved mutational models and functionally relevant 
annotations will greatly increase power to detect highly-penetrant dis- 
ease-associated non-coding variation, for example, increasing power 
more than tenfold from 8% to 83% in 40,000 trios (Extended Data 
Fig. 10c). Functional characterization of increasing numbers of robustly 
associated, highly-penetrant, regulatory variants in cellular and animal 
models will be critical in moving from a descriptive to a more predictive 
understanding of non-coding variation in the human genome, as well 


as elucidating its underlying pathophysiological mechanisms. 
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METHODS 


Defining targeted non-coding elements. The placental mammal 28-way phast- 
Cons score!” was used to select the top 4,432 CNEs with no overlap with RefSeq 
genes (downloaded from UCSC on 4 August 2010). Using the VISTA enhancer 
browser”, all 622 putative enhancers with evidence of in vivo activity in developing 
mouse embryos were downloaded on 3 August 2010. At the time the capture was 
designed, it had been observed that heart enhancers are depleted among ultra- 
conserved elements’. As heart defects are the largest group of non-CNS abnor- 
malities in the DDD cohort we sought to supplement the ultra-conserved 
elements with an early annotation of heart enhancers. These putative heart enhancers 
were provided by A. Visel and based on chromatin immunoprecipitation with 
sequencing (ChIP-seq) of p300 in human fetal heart described previously”! in 
GRCh36 coordinates, mapped over to GRCh37. Collectively, these elements cover 
approximately 4.6 Mb of total sequence. First, elements were filtered to exclude any 
targeted sequences with less than 10 x coverage across the DDD data set. Second, 
any elements previously annotated to be non-coding, but classified as protein- 
coding in Gencode v19*’, were removed. Finally, any elements less than 50 bp 
in length were excluded. After filtering, 4,307 conserved elements, 595 enhancer 
elements and 1,237 putative heart enhancers remained. 
Defining intronic control sequences. The exome baits designed to capture the 
coding regions frequently have considerable overlap with non-coding intronic 
regions. To define a set of putative well-covered introns, a 10-bp buffer was added 
upstream and downstream of all gencode v19 coding sequence (to avoid clas- 
sifying any critical splice sites in the control introns) and this coding sequence 
was subtracted from the exome probes. Furthermore, any introns within known 
developmental disorder genes (the DDG2P gene set'”) were excluded. This set of 
control introns was filtered to include only elements 30 bp in length or larger with 
>30x coverage. 
Evolutionary conservation of non-coding elements. The degree of evolutionary 
conservation across vertebrates at the element level was calculated using the phast- 
cons vertebrate 100-way score. Scores were retrieved in R using the Bioconductor® 
package phastCons100way.UCSC.hg19!. 
Benchmarking CADD and other variant scoring methods using MAPS. Scores 
for all possible SNVs genome-wide were downloaded from CADD” (http:// 
cadd.gs.washington.edu/download), Genomiser*! (https://charite.github.io/ 
software-remm-score.html#download), and fathmm-MKL* (https://github.com/ 
HAShihab/fathmm-MKL). 
Functional genomic annotation. Data from DNase hypersensitivity assays (broad- 
Peak set, FDR 1%) were downloaded from the Roadmap Epigenome Project* 
ftp site (http://egg2.wustl.edu/roadmap/data/byFileType/peaks/consolidated/ 
broadPeak/) in order to predict regulatory function and tissue specificity in the 
enhancers and CNEs. The GenomicRanges Bioconductor package was used to 
intersect DHS peaks with the elements sequenced in this analysis. All code used 
in this analysis can be found at https://github.com/pjshort/DDDNonCoding2017. 
Chromatin state predictions (chromHMM 15-state model”) for 111 different 
tissue types were downloaded from the Roadmap Epigenome Project”® (REP) ftp 
site (http://egg2.wustl.edu/roadmap/data/byFileType/chromhmmSegmentations/ 
ChmmModels/coreMarks/indivModels/default_init/). We considered a CNE to 
be inactive in a given tissue if it was completely contained within a chromHMM 
segment described as quiescent, heterochromatin, or polycomb repressed (“9_Het, 
“13_ReprPC; ‘14_ReprPCWk, and ‘15_Quies’) in the 15-state model. Using the 
GenomicRanges** Bioconductor package and coding sequence from gencode v19, 
we calculated the distance of each active and broadly inactive element to the nearest 
exon or transcription start site. All code used in this analysis can be found at https:// 
github.com/pjshort/DDDNonCoding2017. 
Variant calling, QC, and filtering for unaffected parents. Mapping of short-read 
sequences was carried out using the Burrows-Wheeler Aligner (BWA; version 
0.59) algorithm with the GRCh37 1000 Genomes Project phase 2 reference. The 
Genome Analysis Toolkit (GATK; version 3.1.1) and SAMtools (version 0.1.19) 
were used for sample-level BAM improvement. Ensembl Variant Effect Predictor 
(VEP) based on Ensembl gene build 76 was used to annotate variants and, in 
coding regions, the transcript with the most severe consequence was selected. 
We determined the number of variants called per individual, and excluded unaf- 
fected parents with variant counts on the extremes of the distribution (top 1% and 
bottom 1%). We identified a trinucleotide-specific error mode (GTN—GGN) that 
introduced false positives, which was corrected by strict strand filtering (FS <20). 
Across the 7,080 unaffected parents that passed quality control filters, we identified 
1,520,250 unique variants in the targeted non-coding elements and coding regions. 
MAPS metric within functional annotations. We used DHSs from 39 different 
tissues to annotate unique variants from 7,080 unaffected parents in the 4,307 
CNEs and 595 enhancers we targeted in this analysis as ‘in peak or ‘outside peak. 
We calculated the MAPS” for each annotation (within DHS peak in any tissue, 
outside DHS peak in all tissues, and within DHS peak in each specific tissue). 


ARTICLE 


Where possible, we grouped individual tissues into larger tissue groups based on 
the Roadmap Epigenome Project” (REP) categorization. Within the fetal brain 
tissues (E081, E082 in REP), we calculated the MAPS score for each of the 15 states 
in order to assess differences in purifying selection between elements that are likely 
to be inactive versus those that are active. 

Defining exome-positive and exome-negative probands. We used the 
Developmental Disorders Genotype-to-Phenotype Database (DDG2P) to define 
a set of high-confidence developmental disorder genes (https://decipher.sanger. 
ac.uk/ddd#ddgenes). We classified all probands as ‘exome-negative’ if they did 
not have a protein-altering (stop-gain, splice site, or missense variant) DNM in 
a DDG2P gene with a monoallelic loss of function mechanism, a rare inherited 
biallelic variant in a DDG2P gene with a recessive mechanism, or a copy number 
variant identified by clinical microarray and determined to be pathogenic. 

De novo mutation calling. De novo mutations were called as described'®, excluding 
variants with posterior probability <0.00781 as annotated by DeNovoGear™. 
Trinucleotide germline mutation rate model with CpG-methylation status. 
A previously described germline mutation rate model based on trinucleo- 
tide context”® was adapted to include a correction at CpG sites for methylation 
status. This method models the null mutation rate at a given site as a Poisson 
rate parameter that is dependent on the trinucleotide context, where the second 
base is mutated. We fit a linear model to the ratio of observed/expected variants 
at MAF <0.1% in CpG sites based on their methylation status in embryonic stem 
cells. For all CpG sites, we corrected the trinucleotide mutation rate based on 
the methylation status to produce a methylation-aware mutation rate model. As 
the sum of Poisson random variables is Poisson, the rate parameter for a given 
element, or set of elements, can be determined by summing the mutation rate for 
each individual site. Simulated mutations were based on the same trinucleotide 
mutation framework and implemented in an R software package (https://github. 
com/pjshort/DenovoSim). 

Testing for enrichment of mutagenic genomic features. The CNEs sequenced in 
this analysis were intersected with four genomic features previously associated with 
hypermutability: H3K9me3, H3K27me3, replication timing, and recombination 
rate. A x?-test was used to test whether elements in which DNMs were observed 
were enriched for H3K9me3 or H3K27me3 peaks compared to elements in which 
no DNM was observed using primary mononuclear cells from peripheral blood 
(Roadmap Epigenome ID E062). For replication timing”* and recombination rate”, 
a Wilcoxon rank sum test was used to test for differences between the two sets of 
elements. 

Testing for hypermutability using rare variation in deep whole genomes. 
We calculated the number of observed rare variants (MAF <0.1%) per unit of 
expected mutability from the null mutation model for the fetal brain-active and 
brain-inactive elements that we sequenced in 7,509 non-Finnish European deep 
whole genomes present in the gnomAD data set. We used bootstrap re-sampling to 
estimate the standard error around the estimated rare variants per unit mutability 
for each set. To assess the power of this approach to detect mutability, we simulated 
rare variants using the null mutation model under 1.1 x, 1.2, and 1.3 x mutability 
in the fetal brain-active elements and tested the power to reject the null hypothesis 
of mutability 1.0 x for different numbers of elements from 50 to 1,000 in steps of 50. 
Statistical testing for mutational burden. The P value for the number of observed 
de novo mutations compared to expected is calculated in R as: ppois(n_obs - 1, 
lambda = mu, lower.tail = FALSE) where n_obs is the number of observed muta- 
tions within an element and mu is the mutability of the element(s) being tested 
(under the null model described above) multiplied by the number of probands. 
The burden testing we performed across subsets of elements and phenotypes 
included multiple nested hypotheses that were accounted for with a conserva- 
tive Bonferroni-adjusted P value threshold based on the number of explicit and 
implicit tests. We corrected for thirteen tests within the exome-negative cohort 
based on branching on element class and phenotype, where appropriate (detailed 
in Supplementary Fig. 5). In testing for single elements with an excess of observed 
mutations, we employed a conservative Bonferroni adjustment to correct for 2,613 
tests (the number of fetal brain-active CNEs). 

Defining fetal brain-active elements. We used the Roadmap Epigenome Projec 
DNase data and chromHMM annotations to annotate the CNEs as ‘active’ and 
‘inactive’ in the fetal brain. We defined all of the sections of the genome predicted 
to be quiescent, heterochromatin, or polycomb repressed (‘9_Het’ ‘13_ReprPC’, 
‘14_ReprPCWk, and ‘15_Quies’ in the 15-state model) as ‘inactive’ states. We con- 
sidered a CNE or enhancer to be inactive if it was completely contained within 
an inactive chromHMM” segment in both male and female fetal brain and if it 
did not overlap with any high confidence DNase hypersensitive site in male or 
female fetal brain. In total, 2,613 of 4,307 CNEs and 383 of 595 experimentally 
validated enhancers were predicted to be active in the fetal brain based on these 
criteria. All code used in this analysis can be found at https://github.com/pjshort/ 
DDDNonCoding2017. 
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Estimating the proportion of fetal brain-active CNEs acting as enhancers. 
To evaluate the proportion of CNEs that may be acting as enhancers, we ana- 
lysed downloaded enhancer RNA (eRNA) data from the fetal brain generated 
by the FANTOM5S consortium” and predicted fetal brain enhancers from 
EnhancerAtlas*’, which combines multiple sources of data to identify enhancers 
in different tissues. We used the experimentally validated fetal brain-active VISTA 
enhancers to estimate this sensitivity of each data set. We then overlapped the fetal 
brain-active CNEs with the FANTOMS5 eRNA and EnhancerAtlas predictions and 
used the sensitivity estimates to estimate the total proportion of fetal brain CNEs 
likely acting as enhancers. 

Stratifying enhancers by evolutionary conservation. Phastcons vertebrate 
100-way scores’? were retrieved in R using the Bioconductor package phastCon- 
s100way.UCSC.hg19 for each of the 383 fetal brain-active experimentally validated 
enhancers. 

Statistical testing for enrichment of recurrently mutated elements. We 
defined any element observed with a DNM in at least two unrelated probands 
as ‘recurrently mutated’ We used the simulation framework described above 
to calculate the likelihood of observing a given number of recurrently mutated 
elements. To calculate the significance of individual elements, we calculated 
the likelihood of observing n DNMs in a given element with mutability lambda 
in R as ppois(n_obs - 1, lambda = mu, lower.tail = FALSE). The P values were 
compared to a genome-wide significance cutoff of 0.05/2,613 or P< 1.91 x 10° 
(Bonferroni-corrected P value based on independent tests for enrichment across 
2,613 elements). 

Defining CNE clusters. In order to identify clusters of CNEs, we compared the 
inter-element distance in our set of sequenced CNEs to the inter-element distance 
of the same number of elements randomly distributed genome-wide. We used 
agglomerative hierarchical clustering with single linkage clustering in R to define 
clusters at a given inter-element distance. The false discovery rate (FDR) for a set 
of clusters can be determined by comparing the number of observed clusters to the 
number expected under the randomly distributed null model at the same inter- 
element distance. For this analysis, we used a maximum inter-element distance of 
10kb, which corresponds to a false discovery rate of 10%. 

chromHMM state of recurrently mutated elements. We used the chromHMM 
15-state model predictions from the Roadmap Epigenome Project”® (REP) fetal 
brain male and female (E081, E082 in REP) to classify each of the DNMs observed 
in recurrently mutated elements. The predicted state in male/female fetal brain was 
not always concordant. When one annotation predicted the element as inactive and 
the other as active, we kept the active prediction. When the DNM was predicted to 
be active in both male and female fetal brain, but in different states, we chose the 
male state. Re-running this analysis to instead choose the female prediction did 
not substantially change the outcome. 

Phenotypic similarity by human phenotype ontology comparison. Referring 
clinicians used the Human Phenotype Ontology (HPO) version 2013-11-30 to 
systematically describe patients upon recruitment to the DDD study. In order to 
compare phenotypic similarity between groups of patients statistically, the hpo 
similarity test was used’. 

Clustering of DNMs. To test the observed DNMs for clustering that might imply 
disruption of an underlying binding site or functional motif, we used the deno- 
vonear framework described previously'*. This method compares the distance 
between observed DNMs to the distance between simulated DNMs based on the 
trinucleotide null mutation model to generate an empirical P value. 

Gene target prediction and pair-wise overlap. We used four different methods 
of gene target prediction to link CNEs and enhancers to putative target genes. 

The first method, Genomicus, predicts gene targets based on evolutionary con- 
servation with nearby genes. Genomicus determines the extent to which each CNE 
is within the same syntenic block with nearby genes across a number of vertebrate 
species and predicts one or more targets!!. The Genomicus method produces at 
least one prediction for 90% of CNEs (approximately one-third of these are the 
closest genes). 

The second method compares DNase hypersensivity at each CNE to expres- 
sion of nearby genes in 56 different tissues (using RNA sequencing (RNA-seq)) 
to search for CNE-gene pairs that show a correlation between DNase signal and 
gene expression*!. This method produces statistically significant predictions for 
only 28% of CNEs in our set and is likely to be underpowered to detect elements 
that are active in specific tissues or time points. 

The third method is to link CNEs to putative target genes using chromatin 
interaction data (Hi-C) in two different regions of the fetal brain*®. The use of 
Hi-C data is the most direct and tissue-specific of all of the prediction methods 
used, but the prediction is sparse (26% of CNEs with evidence of fetal brain activity 
have a predicted target). 

The fourth method used is a simple heuristic to choose the gene with 
the closest TSS (for intergenic elements) or the gene containing the element 


(for introns). Choosing the closest gene allows us to make a prediction for 100% of 
elements, but comparison with chromatin conformation and DHS-based methods 
has shown that the closest gene is likely to be the target in 7% and 12% of cases, 
respectively***”, 

We used the Genomicus, DHS, and Hi-C predictions to generate aggregated 
predictions which we considered ‘high confidence’ if predicted by at least two of 
the three methods. 

To assess the pair-wise concordance reported in Extended Data Fig. 7, we took 

the set of CNEs for which at least one gene target was reported in both methods 
and tested how frequently both methods identified the same gene within the set 
of predicted targets. 
Brain developmental expression trajectory. BrainSpan developmental RNA- 
seq data (http://www.brainspan.org) were processed as previously described*. 
Expression values were log-transformed (log)[RPKM-+1]) and scale normalized. 
This expression data set consists of six brain regions (cortex, thalamus, striatum, 
hippocampus, amygdala, and cerebellum) and developmental epochs that span 
prenatal (8-37 post-conception week) and postnatal (4 months-40 years) periods. 
Genes that are associated with CNEs with DNMs and without DNMs were selected 
and their developmental expression trajectories were plotted using loess smooth. 
Transcription factor binding analyses. The JASPAR2016 and TFBSTools 
Bioconductor packages“ were used to retrieve position weight matrices for 454 
human transcription factors. Analyses in this paper focus on the 202 transcription 
factors predicted to be expressed in the brain (cortex-expressed from GTEx data 
set’). 

A custom R package called ‘denovoTF’ (https://github.com/pjshort/denovoTF) 
was written to predict any change in transcription factor binding at sites where 
DNMs were observed or simulated. This analysis works by scanning the reference 
and alternative sequences for all 202 PWMs and comparing predicted binding 
events on both sequences. By comparing the potential binding affinity for ref 
and alt sequences, we can predict loss of binding (alt binding < ref binding), gain 
of binding (alt binding > ref binding), and silent (no difference). ‘Silent’ DNMs 
fall into two classes: those for which binding is predicted on both reference and 
alternate, but strength of binding is unchanged, and those which do not lie ina 
predicted transcription factor binding site. 

The analysis of motif enrichment (AME) tool from the meme suite was used 
to identify a subset of PWMs that was significantly enriched in fetal brain-active 
elements”’. Comparing the fetal brain-active CNEs to the fetal brain inactive CNEs 
returned a set of 90 transcription factors, of which 45 were expressed in the brain 
and had PWMs available in JASPAR2016"*. This analysis was performed on the 
meme-suite web server using the following command: 

ame-verbose 1-oc.—control meme_chromHMM_fb_inactive_all.fasta— 
bgformat 1-scoring avg-method ranksum-—pvalue-report-threshold 0.05 meme_ 
chromHMM__fb_active_all.fasta db/JASPAR/JASPAR_CORE_2016.meme 

In order to test for enrichment of loss of binding or gain of binding events in the 
observed DNMs, we compared predicted impact on transcription factor binding 
in observed DNMs to 1,000 simulations of mutations across the 2,613 fetal brain- 
active elements for 6,147 probands. 

Nucleotide-level conservation (PhyloP). PhyloP scores represent the —logio 
P value that a given site is evolving neutrally*. We used a tabix file of pre-computed 
PhyloP vertebrate 100-way scores for every site in the genome in order to annotate 
the DNMs observed in exome-negative probands to exome-positive probands as 
well as the simulated null model. 

Power calculations at different study sizes. We used the trinucleotide null 
model described previously in order to estimate our power to detect disease- 
associated elements. Parameters that affect power include the fold enrichment 
for disease-causing mutations in the DDD cohort (proportional to the incidence 
of severe developmental disorders with a genetic basis in the population), the 
proportion of mutations within a true disease-associated element expected to 
be pathogenic, the penetrance of such mutations, the size and mutability of the 
elements tested, and the number of trios analysed. To estimate the power across 
different study sizes, we fixed the remaining parameters as follows: 120-fold enrich- 
ment for disease-causing mutations, proportion of mutations expected to be patho- 
genic at 8% (lower bound estimate for coding regions), penetrance at 100%, and 
the elements tested were the 2,613 fetal brain-active CNEs. Code for power analysis 
can be found in the R script: https://github.com/pjshort/DDDNonCoding2017/ 
blob/master/analysis_notebooks/Figure4_maximum_likelihood_and_genome 
estimate.Rmd. 

Likelihood of power calculation model parameters under observed data. To 
test the likelihood of different models of dominant disease mechanism within 
the non-coding space we adapted the power calculation framework described 
above to test the probability of observing our data across two different parameters: 
the number of elements (out of 2,613) with a dominant disease mechanism and 
the proportion of mtuations expected to be pathogenic. We tested the likelihood 
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of observing 286 DNMs, 25 recurrently mutated elements, and zero elements 
at genome-wide significance while systematically varying two parameters: the 
proportion of mutations expected to be pathogenic parameter (from 0.01% to 
10.0% in increments of 0.01%) and the proportion of elements with true disease 
associations (from 0 to 2,613 in increments of 5). In this analysis, the remaining 
parameters were held constant: 120 enrichment for pathogenic mutations, 
penetrance at 100%, testing 2,613 fetal brain-active CNEs, and number of trios 
at 6,147. Code can be found in the R notebook: https://github.com/pjshort/ 
DDDNonCoding2017/blob/master/analysis_notebooks/Figure4_maximum_ 
likelihood_and_genome_estimate.Rmd. 

Estimating the genome-wide burden of DNMs in fetal brain-active elements. 
First, we intersected all targeted non-coding sequences, irrespective of original 
class, with fetal brain DHS peaks. We used the phastcons100 score (scores retrieved 
in R using the Bioconductor package phastCons100way.UCSC.hg19"° to rank these 
elements by evolutionary conservation. The ratio of observed/expected DNMs 
was computed with a sliding window across the elements (window size of 1,000 
elements, shift of 100 elements). This approach resulted in a median of 62 DNMs 
expected in each bin (minimum 51, maximum 68) which was compared to the 
observed number of DNMs. We fit a logistic regression to the excess observed/ 
expected in each window, setting any window with observed less than expected to 
have an excess of zero. We used the logistic regression fit on the CNEs sequenced 
in our analysis to predict the burden of DNMs in this genome-wide set. 
Transmission of rare variants. All variants that were heterozygous in one parent 
were tested for any patterns of overtransmission within different variant classes. 
Only elements with >20 median coverage were used for this analysis, as elements 
without adequate coverage showed systematic underestimation of transmission. 
The observed proportion of rare variants that were transmitted from parents 
to affected probands was compared to the expected proportion under the null 
hypothesis (50%) using a binomial test. 

Testing CNEs for ‘already perturbed haplotypes’. In order to test the hypothesis 
that DNMs in fetal brain-active CNEs may be contributing to a developmental 
disorder via a second hit on an already weakened haplotype, we extracted the 
rare variants present in the relevant DNM-containing CNE for each proband. We 
compared the proportion of probands with at least one variant besides the observed 
DN\M in the fetal brain-active CNEs compared to the fetal brain-inactive CNEs. 
We also calculated the total burden of rare variation within the DNM-containing 
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element (measured as SNVs per kb) for probands with DNMs in fetal brain-active 
CNEs compared to probands with DNMs in fetal brain-inactive CNEs. 

Code availability. Source code used to analyse data and generate the figures for 
this article can be found at https://github.com/pjshort/DDDNonCoding2017/. 
Data availability. Sequencing and phenotype data are accessible via the European 
Genome-phenome Archive (EGA) under study number EGAS00001000775 
(https://www.ebi.ac.uk/ega/studies/EGAS00001000775). The DDG2P gene list 
of genes associated with developmental disorders is available at www.ebi.ac.uk/ 
gene2phenotype. All other data are available from the corresponding author upon 
request. 
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Extended Data Figure 1 | Coverage in targeted non-coding elements. Coverage in the targeted non-coding elements is comparable to the protein- 
coding exons (median 73 x and 56x, respectively). 
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Extended Data Figure 2 | Assessment of variant deleteriousness metrics 


and selective pressure in CNEs. Dots and bars represent the point 
estimate and 95% CI, respectively, for MAPS and proportion singletons. 
a, b, Fathmm-MKL (a) and Genomiser (b) separate benign variation (low 
MAPS score) from likely damaging variation (high MAPS score), but do 
not identify any classes of variation under strong selective constraint. 


c, There was no significant difference in the strength of purifying selection 


measured by MAPS between sites predicted to result in loss, gain, or 
no change in transcription factor binding. d, Validation of Fig. 1c using 
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whole-genome data from the UK10K project. While CADD can identify 
coding variation under strong selective constraint (as measured by the 
proportion of singletons), CADD is unable to identify strongly constrained 
non-coding variants. e, f, The subset of CNEs sequenced in the DDD 
cohort that are predicted to be inactive in all 111 Roadmap Tissues 

(n= 261) exhibit a similar degree of evolutionary conservation (e) but 
lower selective constraint (f) in a healthy population compared to CNEs 
active in at least one tissue (n = 4,046). 
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Extended Data Figure 3 | Genomic factors that affect mutation rate in 
non-coding elements. a, Aggregating CpG sites genome-wide into bins 
of methylation proportion from 0% (unmethylated in all cells) to 100% 
(methylated in all cells) and calculating the observed/expected ratio 
reveals differences in mutability not accounted for a by a triplet model 
alone. b, A mutation rate model incorporating a correction for CpG 
methylation explains greater variance in rare variant counts in the DDD 
unaffected parents. c, Levels of rare variation in deep whole genomes 
(n=7,509 non-Finnish Europeans) were used to estimate power to detect 
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a hypermutability of 1.1, 1.2, or 1.3x.d, The level of rare variation in 
the fetal brain-active elements (n= 2,613, FB(+)) is slightly lower than in 
the fetal brain-inactive elements (n = 1694, FB(-)), consistent with similar 
mutability between the two element sets with slightly stronger purifying 
selection in the fetal brain-active elements. e, f, Elements with DNMs 
observed in our study are not enriched in late-replicating regions (e) or in 
regions with higher recombination rate (f), which have been shown to be 
hypermutable. 
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Extended Data Figure 4 | Non-coding mutations in exome-positive 
probands and poorly evolutionarily conserved sites make a minimal 
contribution to severe developmental disorders. a, In the 1,691 ‘exome- 
positive’ probands, there is no evidence for a burden of DNMs in any of the 
non-coding element classes tested. Red diamonds indicate the observed 
counts, while black circles and bars indicate the expected count and 95% 


CI, respectively. b, DNMs in exome-negative probands show a greater 
degree of evolutionary conservation (measured by PhyloP score) than 
DNMs in exome-positive probands in two classes: fetal brain-active CNEs 
(median 1.57 exome-positive, 2.85 exome-negative, n = 368 mutations) 
and missense changes (median 3.43 exome-positive, 3.98 exome-negative, 
n= 6,244 mutations). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


ARTICLE 


Heart Enhancers VISTA Enhancers Conserved Noncoding Elements 
HeartPheno No Heart Pheno Fetal Brain Active Fetal Brain Inactive Fetal Brain Active Fetal Brain Inactive 
Neurodev Pheno No Neurodev Pheno Neurodev Pheno No Neurodev Pheno 


Conserved Fetal Brain Active Enhancers 


‘ 
8 ‘ 
‘ 
= 
- ss p= 0.01 
‘ 
] 
‘ 
‘ 
‘ 
= Ss 
g 2 
5 ‘ 
‘ 
= ' 
] 
4 ‘ 
‘ 
é 
“n 8 . 
] 
] 
‘ 
‘ 
) 
‘ 
o 
ee | 
0 4 8 12 16 2 24 2 32 3% 40 44 48 52 56 
Number of DNMs 
Extended Data Figure 5 | Hypothesis test enumeration and enrichment based on element class and phenotype in this analysis. b, Evolutionarily 
for mutations in highly conserved fetal brain-active enhancers. a, We conserved fetal brain-active enhancers (n = 106) are enriched for DNMs in 
corrected for thirteen tests in order to account for the nested hypotheses exome-negative probands. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


Fetal Brain Hi-C - 0.17 
Agreement Between 
_ Gene Target Predictions 
DHS Correlation - 1.00 
A 0.50 
synteny 0.00 
Closest Gene - 
wer ss gs? 
A> Se 
or ee a ot 
C Brain Developmental Expression Trajectory 
Prenatal (Post-Conception Week) 
c 175- 
2 
& 1.50 - 
) 
@ 125- 
F 
6 1.00- 
2 v 
0.75 - ' 1 ' ' ‘ 


10 20 30 
Prenatal Age (week) 


Extended Data Figure 6 | Gene target prediction for targeted non- 
coding elements. Pairwise concordance between four different gene 
target prediction methods is low. Using predicted targets from fetal brain 
Hi-C data, elements with an observed DNM in exome-negative probands 


ARTICLE 


Fetal Brain Hi-C 
DDG2P_ - = 
LoF-constrained - 3s 
@ all elements 
© elements w/ DNMs 
Brain Enriched - ——:— 
Cortex Expressed - =o 
0.0 0.1 0.2 0.3 0.4 


Proportion of Fetal Brain Active CNEs Predicted to Contact Gene 


Postnatal (Years) 


~~ Hi-C:DNM— 


20 
Postnatal Age (year) 
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implicated in neurodevelopmental disorders. Dots and bars represent the 
point estimate and 95% confidence interval, respectively. 
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Encoding of danger by parabrachial 


CGRP neurons 


Carlos A. Campos!, Anna J. Bowen!, Carolyn W. Roman! & Richard D. Palmiter! 


Animals must respond to various threats to survive. Neurons that express calcitonin gene-related peptide in the 
parabrachial nucleus (CGRP?®% neurons) relay sensory signals that contribute to satiation and pain-induced fear 
behaviour, but it is unclear how they encode these distinct processes. Here, by recording calcium transients in vivo 
from individual neurons in mice, we show that most CGRP?®% neurons are activated by noxious cutaneous (shock, heat, 
itch) and visceral stimuli (lipopolysaccharide). The same neurons are inhibited during feeding, but become activated 
during satiation, consistent with evidence that CGRP?®" neurons prevent overeating. CGRP?®" neurons are also activated 
during consumption of novel foods or by an auditory cue that has previously been paired with electrical footshocks. 
Correspondingly, silencing of CGRP’®" neurons attenuates the expression of food neophobia and conditioned fear 
responses. Therefore, in addition to transducing primary sensory danger signals, CGRP?® neurons promote affective- 
behavioural states that limit harm in response to potential threats. 


The parabrachial nucleus (PBN) receives ascending inputs from 
many sensory modalities and relays this information to brain areas 
that transform sensory signals into autonomic, affective, and behav- 
ioural responses!?. We have focused on calcitonin gene-related 
peptide (CGRP)-expressing neurons in the external lateral PBN, 
initially because disinhibition of these neurons following ablation of 
hypothalamic neurons that express orexigenic agouti-related peptide 
(AgRP) causes severe anorexia®, Subsequently, we found that activa- 
tion of CGRP?®N neurons contributes to meal termination‘, indicating 
that these neurons play a physiological role in controlling appetite. 
Additional studies have shown that CGRP?8N neurons are directly acti- 
vated by neurons that reside in the nucleus of the solitary tract (NTS)°, 
which receives visceral satiety signals via vagal sensory afferents®. 
Notably, CGRP?®N neurons are also part of a spino-parabrachial 
pathway that is critical for eliciting fear responses to cutaneous pain’. 
CGRP?8N neurons have axonal projections that activate the central 
amygdala and the bed nucleus of the stria terminalis (BNST)’, fore- 
brain nuclei that have been implicated in both feeding and pain- 
related sensory processing’ -’. These observations raised the question 
of whether subpopulations of CGRP?®N neurons relay specific stimuli 
to the forebrain. 

We hypothesized that CGRP?®N neurons perform a common func- 
tion of signalling danger, whether it relates to cutaneous pain or con- 
sumption of food beyond physiological capacity, and reasoned that 
visualization of the calcium dynamics (correlates of neural activity) of 
individual neurons in vivo'®"' would allow us to investigate whether 
subpopulations of CGRP?2N neurons respond to specific stimuli. We 
show that the same neurons respond to feeding and various noxious or 
painful stimuli. CGRP?® neurons also respond to danger signals that 
require learning and memory. Together with loss-of-function studies 
using transgenic and viral techniques, these results show that activation 
of CGRPP3N neurons contributes to affective-behavioural states that 
limit harm. 


Coding of painful stimuli by CGRP?®" neurons 


We injected adeno-associated virus (AAV) expressing Cre-dependent 
GCaMP6m into the lateral PBN of transgenic mice with Cre 


recombinase targeted to the CGRP-encoding gene (Calca“”’*), thereby 
introducing a calcium-sensitive fluorophore specifically into CGRP?2N 
neurons (Fig. 1a). This strategy allowed us to record calcium dynamics 
from individual neurons in mice implanted with a graded-index 
lens (Fig. 1b, c). A tail pinch transiently increased calcium activity 
in CGRP?®N neurons (Fig. 1d and Supplementary Video 1); all 384 
neurons tested responded to at least four out of five tail pinches during 
a single recording session. Most neurons exhibited increased calcium 
activity following a pinch to the tail or any of the four paws (Fig. le, f). 
To determine whether CGRP’? neurons are activated by noxious 
stimuli on the face, we placed a warm metal rod on the lip; this acti- 
vated the same neurons that responded to tail pinch (Extended Data 
Fig. 1a, b). Intraperitoneal injection of lipopolysaccharide (LPS), which 
mimics bacterial infection and causes visceral malaise, also activated 
pinch-responsive neurons (Fig. 1g, h and Extended Data Fig. 1c). Thus, 
individual CGRP?®N neurons are activated by noxious stimuli applied 
to all body parts examined, consistent with studies of electrical activity 
from unidentified PBN neurons’? 

To determine whether CGRP?8N neurons code for the intensity 
of noxious stimuli, we placed the tails of unconscious mice into a 
temperature-controlled water bath. Neurons were unresponsive to 
temperatures below 44°C, but most neurons were activated at 46°C 
and above (Fig. 1i). The calcium activity of CGRP?8N neurons increased 
with increasing temperatures (Fig. 1j and Extended Data Fig. 2a—e), as 
found previously for unidentified PBN neurons)”. Likewise, CGRP?BN 
neurons showed an increase in calcium activity as the intensity of elec- 
trical shocks delivered to the tail increased (Extended Data Fig. 2f). In 
awake mice, neurons also exhibited an increase in activity in response 
to increasing temperatures during a hotplate test (Extended Data 
Fig. 2g, h), although the mice were able to mitigate some of the pain by 
alternating the paws that were in contact with the hotplate. We conclude 
that CGRP?®N neurons encode stimulus intensity. 

In awake and anaesthetized mice, electrical shock induced a rapid 
increase in the calcium activity of most CGRP?®N neurons (Fig. 1k, 1 
and Extended Data Fig. 1d, e). We also measured calcium activity 
in a neighbouring neuronal population that controls fluid intake— 
oxytocin receptor-expressing neurons in the dorsal lateral PBN“. 


1Howard Hughes Medical Institute and Department of Biochemistry, University of Washington, Seattle, Washington 98195, USA. 
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Figure 1 | CGRP?®N neurons are activated by painful stimuli. 

a, Schematic*” showing configuration for imaging of PBN CGRP 
neurons. b, Mouse with the microscope attached. c, Individual neurons 
identified with software. Coloured circles correspond to traces in d. 

d, Calcium fluorescence activity of neurons outlined in c recorded during 
tail pinch. e, f, The same neurons are activated by pinching the tail or any 
four paws of anaesthetized mice (n =5 mice, 384 neurons). g, Calcium 
activity following intraperitoneal injection of LPS or saline in awake 
mice (m= 6 mice, 324 neurons saline, 372 neurons LPS); data points are 
individual neurons and lines are means (two-way ANOVA, Bonferroni’s 
post-hoc). Baseline was 5-min period before injection. h, Percentage 


Electrical shock did not rapidly affect the activity of those neurons 
(Extended Data Fig. le). The time-locked response of CGRPPBN 
neurons, but not of neighbouring PBN neurons, further suggests that 
CGRP?®N neurons are a primary relay for pain signals’. 


Response to non-painful noxious stimuli 

Pain can signal ongoing harm, whereas itch is an adaptive response 
that may lead to avoidance of harmful plants and insects!°. Although 
itch-responsive sensory neurons overlap with those that respond to 
painful stimuli, little is known about brain circuits that contribute to 
itch-related behaviours. We monitored calcium activity in CGRP?8N 
neurons in anaesthetized mice that received a subcutaneous injection 
of chloroquine, a drug that causes itching'®. Chloroquine injection 
increased calcium activity in most CGRP’®N neurons (Fig. 2a). 

To determine whether activation of CGRP?®N neurons is required 
for chloroquine-induced scratching, we injected an AAV expressing 
Cre-dependent tetanus toxin light chain (AAV-DIO-GFP:TetTox) 
bilaterally into the PBN of Calca”'* mice (Fig. 2b). TetTox cleaves 
synaptobrevin, a synaptic vesicle protein that is necessary for neuro- 
transmitter release'’, thereby preventing neurotransmitter release from 
Cre-expressing neurons”!*. Compared to mice injected with a GFP 
control virus, chloroquine-induced scratching was attenuated in mice 
in which CGRP?®N neurons were silenced (Fig. 2c, d). We examined 
whether activation of CGRP? neurons also contributes to scratching 
induced by a non-chemical stimulus by placing adhesive stickers on the 
necks of mice (Fig. 2e). This manipulation activates mechanoreceptors 
and elicits scratching and paw swipes to remove the sticker, but is not 
thought to be painful’’. TetTox silencing of CGRP?®N neurons attenu- 
ated sticker-removal attempts (Fig. 2f). Therefore, CGRP?5N neurons 
can be activated by non-painful noxious stimuli and their activation 


618 | NATURE | VOL 555 | 29 MARCH 2018 


-1 0 41 
Time (s) 


of cells activated by LPS and tail pinch. i, Percentage of cells activated 
following placement of tail in heated water bath (40-52°C) for 20s (n=5 
mice, 286 neurons; one-way repeated measures ANOVA, Tukey’s post- 
hoc). j, Time course of calcium activity for temperature gradient study; 
baseline was 10-s period before water exposure. k, Calcium activity time 
course in response to footshock (0.5 mA, 0.5 s; dashed line) in awake 
mice (n =5 mice, 317 neurons); baseline was 2-s period before shock. 

1, Plot shows activity of neurons (rows) from a representative mouse that 
received a footshock. Line graphs are mean +s.e.m. (shaded region). 
AF/F = (F — Foaseline)/Foaseline. ***P < 0.001. For statistical analysis, see 
Supplementary Information. 


is required to elicit appropriate behavioural responses. These find- 
ings identify a specific population of PBN neurons that contributes to 
itch-related behaviour”. 


CGRP neuron activity during feeding 
Because CGRP?3N neurons contribute to satiation’, we investigated 
whether their activity was modulated by feeding-related stimuli in 
mice that had been deprived of food overnight and then given access 
to their familiar chow pellets. Calcium activity in CGRP?® neurons 
was rapidly inhibited when food was presented (Supplementary 
Video 2), even before the mice had taken the first bite (Fig. 3a, b and 
Extended Data Fig. 3a). We also observed rapid decreases in calcium 
activity before bites (Fig. 3c and Extended Data Fig. 3b). The inhibitory 
responses were observed in nearly all neurons (Extended Data Fig. 3c, 
d). To ensure that these effects were not due to movement artefacts, 
we analysed fluorescence in regions of interest lacking neurons; here 
we observed no changes before or during feeding (data not shown). 
We also analysed calcium recordings using constrained non-nega- 
tive matrix factorization (CNMF)*". Individual traces extracted from 
CNMF analysis show that calcium transients are not synchronized, but 
all neurons are similarly inhibited upon presentation of food and before 
bites (Fig. 3d, e and Extended Data Fig. 3e, f). As activation of CGRPPBN 
neurons suppresses appetite’, their inhibition should support feeding. 
We also examined calcium activity as feeding progressed. Food 
deprivation lowered the activity of CGRP?8N neurons relative to 
the sated state (Fig. 3f), consistent with increased inhibitory input 
from AgRP-expressing neurons, which are activated during food 
deprivation”, and/or decreased excitatory input from anorexigenic 
neurons in the NTS*». After 30 min of feeding, neurons became more 
active (Fig. 3f), which corresponded to a decrease in food intake 
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Figure 2 | CGRP? neurons are activated by non-painful noxious 
stimuli. a, Calcium fluorescence activity in anaesthetized mice that 
received subcutaneous injection of chloroquine (8 mM, 10 11) or saline 
control; baseline was 2-min period before injection. Data points are 
individual neurons and horizontal lines are means (n = 4 mice, 231 
neurons chloroquine, 216 neurons saline; two-way ANOVA, Bonferroni's 
post-hoc). b, Schematic”” showing bilateral PBN injection of Calca“"’'* 
mice with AAV-DIO-TetTox:GFP to prevent neurotransmitter release 
from CGRP neurons or AAV-DIO-GFP control virus. c, d, Quantification 
of chloroquine-induced scratching bouts in GFP and TetTox mice 10 

(P= 0.021) and 20 min (P= 0.011) after injection. e, Placement of sticker 
on neck area of mouse. f, Quantification of sticker removal attempts 
during the 5 min after placing the sticker on GFP and TetTox mice. 
Graphs (c, d, f) show data from individual mice (n = 8 per group; Student's 
t-test, two-tailed); lines represent the mean. *P < 0.05, ***P < 0.001. For 
statistical analysis, see Supplementary Information. 


compared to the first 30 min (0-30 min, 0.71 + 0.04 g; 30-60 min, 
0.22 + 0.02 g (mean + s.e.m.)). In general, these neurons became more 
active as feeding progressed (Fig. 3g), compared to either non-fasted 
or fasted baseline conditions (Fig. 3h). All neurons that were analysed 
in the feeding study also responded to tail pinch (data not shown). 


Coding of food neophobia 

Given that CGRP?® neurons respond to anticipatory signals before 
food consumption, we investigated whether they respond to other 
complex sensory inputs by examining the neophobia response to a 
novel food, an adaptive trait that limits the consumption of unfamiliar, 
potentially toxic foods. In contrast to their response to the familiar 
diet, CGRP?®N neurons showed increased calcium activity when mice 
were presented with an unfamiliar, high-fat food pellet (Extended Data 
Fig. 4a-f and Supplementary Video 3). CGRP?® neurons were also 
activated following each bite during the first trial (Fig. 4a and Extended 
Data Fig. 4g). However, after mice had become acclimated to the unfa- 
miliar food, CGRP?8N neurons became inhibited before and during 
consumption of the palatable diet (Fig. 4b, c and Extended Data 
Fig. 4h, i), as during consumption of their normal diet. The change in 
the pattern of calcium activity corresponded with a decreased latency 
to begin consumption during the last trial (20.4 + 4.1 s; mean + s.e.m.) 
compared to the first trial (115.2 + 16.0s). We also monitored 
calcium activity in response to an inedible novel object (a marble). 
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CGRP? neurons were activated upon first presentation of the object 
(Supplementary Video 4), a response that dissipated as mice became 
acclimated to the object (Extended Data Fig. 5). After acclimation, 
neuronal activity did not decline as with the palatable diet, probably 
because food is appetitive whereas the marble becomes neutral after 
acclimation. 

We examined the necessity of CGRP?®N neurons for behavioural 
expression of food neophobia by silencing them with TetTox (Fig. 4d). 
Compared to control mice, TetTox-injected mice exhibited an attenu- 
ated food neophobia response, which was apparent during the first two 
exposures to a novel palatable diet (Fig. 4e), indicating that CGRPPBN 
neurons contribute to food neophobia. This result is consistent with 
attenuation of food neophobia by lesions of the lateral PBN”. 


Reactivation of CGRP?®’ neurons by fear memory 

With food neophobia, fear behaviour dissipates after repeated exposure, 
suggesting that a positive memory is formed and that CGRP?®N 
neurons are sensitive to learning processes. We postulated that 
CGRP®N neurons might also respond to a negative memory, such 
that exposure to a stimulus paired with a painful event would lead to 
activation of CGRP?® neurons by that conditioned stimulus. 

Mice were subjected to footshock-tone conditioning, in which mice 
learn to associate a neutral stimulus (tone) with an electrical footshock 
(Fig. 5a). These mice formed a fear memory, made evident by the 
fact that the tone alone was sufficient to elicit freezing (a rodent fear 
response) the next day (Fig. 5b). After conditioning, CGRP?®N neurons 
were activated during the 10-s tone presentation (Fig. 5c and Extended 
Data Fig. 6a). Neuronal activation became less robust during extinction 
with repeated presentations of the tone in the absence of footshock 
(Fig. 5d, e and Extended Data Fig. 6a), which corresponded with a 
decrease in freezing behaviour (Fig. 5b). Mice that did not receive 
footshocks with the tone during conditioning did not exhibit tone- 
induced freezing or increased calcium activity in CGRP?®N neurons 
(Fig. 5f-j and Extended Data Fig. 6b). We also investigated whether 
non-specific contextual cues could activate CGRP?®N neurons in mice 
that underwent footshock fear conditioning (Extended Data Fig. 6c). 
Compared to test trials before conditioning, placing mice in the shock 
chamber after conditioning was sufficient to increase calcium activity 
in CGRP?®N neurons (Extended Data Fig. 6d-g). Therefore, in addition 
to being a primary relay of pain signals that are necessary and sufficient 
to form a fear memory’, CGRP?5N neurons also become activated 
during recall of that aversive memory. 

To investigate whether activation of CGRP?8 neurons contributes 
to learned behavioural fear responses, we subjected mice to footshock 
fear conditioning (Fig. 6a) followed by CGRP’? neuron silencing 
with TetTox after the fear memory had been formed (Fig. 6b). We used 
greater shock intensities than in the previous experiment to form a 
more robust fear memory because extinction training occurred weeks 
after conditioning, to allow sufficient time for TetTox expression. 
Compared to GFP-injected control mice, mice in which CGRPPBN 
neurons were silenced rapidly extinguished their freezing response 
during repeated presentations of the tone without footshocks (Fig. 6c). 
By the second fear-extinction-training session, fear-conditioned TetTox 
mice exhibited minimal freezing behaviour (Fig. 6d), similar to control 
mice that did not undergo fear conditioning (Fig. 6e-h). Conversely, 
fear-conditioned GFP-control mice continued to freeze in response 
to the tone during the second extinction-training session (Fig. 6d). 
Thus, activation of CGRP?®N neurons contributes substantially to fear 
responses during recall of an aversive memory. 


Discussion 

Here we show that CGRP?® neurons become activated in response to 
danger signals of diverse sensory modalities—visceral and cutaneous 
pain (chemical, mechanical, thermal and electrical), itch, and meal- 
related satiety signals—that involve various ascending sensory 
pathways (trigeminal, spinal and vagal). CGRP”®N neurons are also 
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Figure 3 | Rapid control of CGRP?®N neurons before and during 
feeding. a, Average calcium fluorescence activity from a representative 
mouse (63 neurons) relative to chow being available (dashed line) and 
each bite taken (red arrows) during a fast-refeed study. b, Calcium activity 
relative to placement of chow in cage (n =5 mice, 282 neurons). Data 

are mean + s.e.m. (shaded region). c, Calcium activity relative to taking 

a bite. Multiple bites were analysed and averaged for each neuron (n=5 
mice, 282 neurons). Data are mean + s.e.m. (shaded region). d, e, Forty- 
seven neuron traces from a representative mouse extracted using CNMF 
analysis, relative to chow availability (d) and to a bite (e). f, Calcium 


activated by danger signals that require cognition, as shown by the 
food-neophobia and fear-conditioning assays, and TetTox loss-of- 
function studies reveal that these neurons contribute to both innate 
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Figure 4 | CGRP?®N neurons are activated in response to a novel and 
palatable high-fat diet. a—c, Calcium fluorescence activity in response 
to taking a bite from a novel, high-fat diet (HFD) pellet. Line graphs 

are means + s.e.m. (shaded region) for raw AF/F values and plots show 
normalized AF/F values for neurons (rows) of all experimental mice 
(trial 1, n=5 mice, 247 neurons; trial 2, n =5 mice, 244 neurons; trial 3, 
n=5 mice, 254 neurons). Test trials were conducted on consecutive days. 
d, Schematic*’ showing bilateral PBN injection of Calca’”’* mice with 
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activity after refeeding. Data are normalized to time-point 0 min, which 
was a 60-s baseline before food availability; ‘no fast’ data were recorded 
24h before fast. Data points are individual neurons and the lines are 
means (n=5 mice, 282 neurons; one-way repeated measures ANOVA, 
Tukey’s post-hoc). Dissimilar letters above columns of data points indicate 
statistical differences between time points. g, Calcium activity of neurons 
from a representative mouse after refeeding. h, Percentage of neurons 
activated relative to fasted or non-fasted baseline conditions. For statistical 
analysis, see Supplementary Information. 


and learned fear behaviours. Thus, CGRP?®N neurons convey 
negative valence, while parallel pathways are likely to provide infor- 
mation regarding sensory modality and stimulus origin” to help elicit 
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AAV-DIO-TetTox:GFP (to silence CGRP neurons) or AAV-DIO-GFP 
control virus. e, Non-food-deprived mice received 20 min access to a novel 
HED pellet on four consecutive days; data points are individual mice (eight 
mice per group; two-way repeated measures ANOVA, Bonferroni's post- 
hoc). Baseline was a 60-s period before HFD pellet was presented. 

*** P< 0.001, *P < 0.05. For statistical analysis, see Supplementary 
Information. 
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Figure 5 | CGRP? neurons are activated during recall of a pain 
memory. a, Fear-conditioning paradigm in which mice received a 
footshock (0.3 mA; 0.5s duration) that co-terminated with a 10-s tone 
(five pairings). b, Percentage of time spent freezing (fear-conditioned 
mice) in response to 10-s tone presentations during the extinction training 
sessions. c-e, Calcium fluorescence activity of neurons in fear-conditioned 
mice during presentation of tone trials 1, 15, and 30. f, Sham-conditioning 
paradigm. g, Percentage of time spent freezing (sham-conditioned mice) 


appropriate responses. In addition to the modalities examined here, 
we hypothesize that CGRP?8% neurons would respond to gustatory”, 
olfactory’’, auditory”®, vestibular”, and other visceral stimuli*®. 

Calcium recordings show that CGRP?®N neurons as a population 
are activated or inhibited in response to (putative) salient stimuli. 
This arrangement is also observed in AgRP ‘hunger’ neurons”; in 
that case, separate projections to distinct brain regions contribute 
to various aspects of hunger-related behaviour and physiology*!~*?. 
CGRP?®N neurons project to the central amygdala and BNST*”, nuclei 
that mediate behavioural and physiological responses to ingestive and 
fear-related stimuli”***, Downstream central amygdala neurons express 
the CGRP receptor’, and a subpopulation of those neurons co-express 
protein kinase C-&*>, but the identity of downstream BNST neurons is 
unknown. We postulate that the central amygdala and BNST contribute 
to different aspects of behaviour and/or alter behaviours over different 
timescales. Behaviours may also be tuned by the magnitude and dura- 
tion of CGRP?®N neuronal activation, which could lead to differential 
activation of postsynaptic targets. 

Although we focused here on aversive stimuli, pain-responsive 
CGRP?®N neurons also respond to feeding-related stimuli. CGRP?2N 


in response to 10-s tone presentations during the extinction training 
sessions. h-j, Calcium activity of neurons in sham-conditioned mice 
during presentation of tone trials 1, 15, and 30. Fear-conditioned mice, 
n=4 mice, 214 neurons; sham-conditioned mice, n = 4 mice, 169 neurons. 
Line graphs are mean + s.e.m. (shaded region). Baseline was 5-s period 
before tone presentation. For statistical analysis, see Supplementary 
Information. 


neurons are rapidly inhibited before food consumption, indicating that 
there is bidirectional control of their activity and revealing that they 
also participate in a normal, non-aversive behaviour. Inhibition from 
AgRP neurons or decreased input from anorexigenic neurons in the 
NTS? is likely to dampen CGRP”®N neuron activity during fasting’, 
but the decreases in calcium activity during food consumption are 
unlikely to come from AgRP neurons because they are inactive during 
feeding’**°. Because CGRP?®N neurons transmit potential danger 
signals, their inhibition during starvation might contribute to adaptive 
behaviours, such as increased risk taking to acquire food*!, in addition 
to permitting increased food intake. 

Numerous forebrain projections to the PBN*”8 are candidates for 
the control of CGRP?®N neurons in response to sensory signals that 
require complex processing during neophobia and learning. Notably, 
fear-conditioned mice with either intact or inhibited CGRP?®N neu- 
rons showed similar behavioural responses during the first tone trial of 
extinction training, but the TetTox-treated mice showed faster extinc- 
tion of their fear memory. This result suggests that activation of other 
brain areas is initially sufficient to elicit a learned fear response, but that 
CGRP?®N neuronal activation is critical for maintaining a persistent 
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Figure 6 | Silencing of CGRP?®N neurons attenuates fear response 
during recall of pain memory. a, Fear-conditioning paradigm in which 
mice received a footshock (0.6 mA; 0.5s duration) that co-terminated 
with a 10-s tone (ten pairings). b, Timeline for experiments and surgeries 
of mice that underwent fear conditioning. After fear conditioning, mice 
received bilateral PBN injections of AAV-DIO-TetTox:GFP to inhibit 
CGRP neurons or AAV-DIO-GFP control virus. c, d, Percentage of 

time spent freezing (fear-conditioned mice) in response to 10-s tone 


presentations during extinction sessions (n = 12 per group; two-way 
repeated measures ANOVA). e, Sham-conditioning paradigm. f, PBN 
virus injections after sham conditioning. g, h, Percentage of time spent 
freezing (sham-conditioned mice) in response to 10-s tone presentations 
during extinction training sessions (n = 6 per group). Line graphs are 
mean + s.e.m. ***P < 0.001. For statistical analysis, see Supplementary 
Information. 
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fear state and retarding extinction. Current fear-extinction hypotheses 
suggest that extinction of a fear memory involves weakening of the 
original associative memory and/or consolidation of a newly acquired 
‘safe’ memory that overrides the original fear memory*”. Our findings 
are congruent with either interpretation, as activation of CGRPPBN 
neurons could strengthen the original fear memory or preclude 
formation of a safe memory. 

We expect that future studies will reveal how CGRP?®N neurons are 
rapidly controlled by various neuronal inputs and shed light on how 
core brainstem circuits are recruited to produce complex behaviours. 
These endeavours may ultimately lead to a better understanding of how 
CGRP?®N neurons alter affective-motivational behaviours associated 
with transient’ and chronic threats*°, and mental disorders that might 
involve dysregulation of upstream neurocircuitry. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


All experimental protocols were conducted according to US National Institutes of 
Health guidelines for animal research and approved by the Institutional Animal 
Care and Use Committee at the University of Washington. 

Animals. Heterozygous Calca“’*+ mice and Oxtr“'!+ mice (C57BI/6 background) 
were generated and maintained as previously described*'*. Male mice, 3-6 months 
of age, were used for all studies. Mice from each litter were randomly assigned 
into treatment groups. Following stereotaxic surgery, mice were singly housed 
for at least three weeks before and during experimentation, with ad libitum access 
(unless noted otherwise) to standard chow diet (LabDiet 5053) in temperature- and 
humidity-controlled facilities with 12-h light-dark cycles. Behavioural experiments 
were conducted during the light cycle, starting at approximately 3h into the light 
cycle. No statistical methods were used to predetermine sample size. Unless stated 
otherwise, the experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 

Virus production. AAV1-CBA-DIO-GCaMP6m, AAV 1-CBA-DIO-GFP:TetTox, 
and AAV1-EFla-DIO-GFP viral vectors were produced by transfecting HEK 
cells and then purifying cell extracts by pelleting through sucrose and by CsCl 
gradient ultracentrifugation. Final pellets were suspended in 0.1 M PBS. Viruses 
were titred as described’. In pilot experiments, we injected varying concentrations 
of AAV1-CBA-DIO-GCaMP6m, examined histological sections four weeks after 
virus injection, and then chose the highest virus titre that produced GCaMP6m 
expression that was limited to the soma. 

Stereotaxic surgeries. All mice were anaesthetized with 1.5 to 2.0% isoflurane 
(mixed with 0.8 | per min of oxygen) for stereotaxic surgeries and virus injections. 
For calcium imaging, Calca“""* mice received a unilateral PBN injection of AAV1- 
DIO-GCaMP6m (0.511). Virus was injected using a glass capillary attached to a 
Nanoject (Drummond, #3-00-204) using the following coordinates: —4.65 mm 
posterior to bregma, —1.30 mm lateral from midline, and —2.90 mm ventral to 
dura. These coordinates were verified with Fluoro-Gold (Sigma-Aldrich, #39286) 
injections in non-experimental mice, which produced Fluoro-Gold labelling that 
was restricted to the lateral PBN. AAV1-DIO-GFP and AAV 1-DIO-GFP:TetTox 
viruses (for loss-of-function studies) were injected bilaterally. 

Mice were prepared for calcium imaging as described*". In brief, three 
weeks after AAV1-CBA-DIO-GCaMP6m virus injection, Calca“*'+ mice were 
anaesthetized (as described above) and implanted with a microendoscope lens 
(6.1mm length, 0.5 mm diameter; Inscopix, #100-000588) with the assistance 
of a ProView implant kit (Inscopix, #100-000754) that allowed us to visual- 
ize fluorescent activity during the lens implant. The lens was targeted to be ~ 
200-300 1m above the neurons using the following coordinates: —4.80 mm 
posterior to bregma, —1.70 mm lateral from midline, and —3.20 to —3.40mm 
ventral to skull surface. Some lenses were purposely implanted deeper to record 
from ventral CGRP?® neurons. To visualize oxytocin receptor neurons, the lens 
was targeted more dorsal and medial: —4.80 mm posterior to bregma, —1.40 mm 
lateral from midline, and —3.00 mm ventral to skull surface. One week after 
lens implantation, mice were anaesthetized and a baseplate (Inscopix, #100- 
000279) was implanted above the lens. The baseplate provides an interface for 
attaching the miniature microscope during calcium-imaging experiments, but 
during other times a baseplate cover (Inscopix, #100-000241) was attached to 
prevent damage to the microendoscope lens. Post-hoc histological analysis showed 
variability in the lens placement relative to the rostral-caudal extent of the PBN, 
suggesting that CGRP?® neurons in different parts of the PBN were sampled 
(Extended Data Fig. 7). Out of approximately 60 mice that were injected with 
GCaMP6m virus, 20 had successful lens implantation and were used for studies. 
Calcium imaging studies. All calcium imaging was recorded at 5 frames per sec- 
ond, 200-ms exposure time, and 10-30% LED power using a miniature microscope 
from Inscopix (nVista). The recording parameters were based on pilot studies that 
demonstrated minimal photo-bleaching using these settings. In our pilot studies 
we also recorded calcium activity with higher frame rates (10 and 15 frames per 
second), but we were unable to find an optimal virus titre that produced sufficient 
fluorescent activity to record at high frame-rates while maintaining the ability to 
discriminate individual neurons. CGRP?®% neurons are densely packed in a small 
brain area and higher GCaMP6m virus titres produced dense dendritic labelling 
that overlapped with the soma of neighbouring neurons. Ethovision (Noldus, XT 
10) was used to trigger and synchronize behavioural video recordings with calcium 
recordings. 

Order of experiments. In one cohort of Calca mice (n=5), fast and refeeding 
studies were conducted first, followed by neophobia studies, and then studies 
involving noxious stimuli (pinch, shock, LPS, itch, and temperature studies). 
A second cohort of naive Calca“"’’*+ mice were used for fear conditioning studies 
(n=4 per group). A third group of Calca“®'* mice (n =3) were used to examine 
graded shock intensities. Oxtr“*'+ mice (n= 3) were used for a shock study. 
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Crel+ mice (n= 3) were used for the hotplate test and con- 


A fourth group of Calca 
textual fear conditioning. 

Fasting-refeeding. Feeding studies (including food neophobia experiments) 
were conducted before the pain studies. Mice were acclimated to handling and 
attachment of the miniature microscope for approximately one hour per day for 
two weeks. After the acclimation period, mice were food deprived overnight, and 
a standard chow pellet (their normal food) was placed on the floor of their cage 
following a 1-min baseline recording period. Calcium activity was recorded con- 
tinuously for 5 min after receiving access to food, followed by 2-min duration 
recordings 10, 20, 30, 40, 50, and 60 min after refeeding. For analysis of fluorescent 
activity relative to taking a bite from the food pellet, we analysed bites during the 
first 2 min of fasting-refeeding, and averaged these values for each neuron. The 
timing of bites was confirmed visually using the behavioural videos. 

Food neophobia. After a baseline recording period (1 min), non-food-deprived 
mice were given access to a novel, palatable food pellet (45 kcal% high-fat diet; 
Research Diets, #D 12451). Calcium activity was recorded continuously for 10 min 
after the onset of access to the novel food. Mice had access to the food pellet for 
20min. Bites were confirmed using visual confirmation of the behavioural videos. 
Test trials were conducted on consecutive days. 

Novel object. We used the same experimental design as with the food neophobia 
studies, except a dark blue marble (1 cm diameter) was placed in the cage instead 
of a novel food pellet. 

Cutaneous pain. For analysis of calcium activity in response to pinch, mice were 
anaesthetized with 1.5-2.0% isoflurane (mixed with 0.8 | per min oxygen). Once 
the calcium activity was stable under baseline conditions, mice received five tail 
pinches using forceps (5-s duration, 2-min interval between pinches). In a separate 
recording session, we analysed calcium activity in response to pinch of the tail 
and paws (5-s duration, 2-min interval between pinches). In another recording 
session, the tail was placed in a temperature-controlled water bath ranging from 
40°C to 52°C (20-s duration, 3-min interval between each temperature gradient). 
Calcium responses to the tail being placed in 56°C and 60°C water (10-s dura- 
tion, 5-min interval between trials) were examined on a separate day. To examine 
calcium responses to facial stimuli, a heated metal rod (probing surface area 
~1 cm?) at ~52°C (the metal rod was in 52°C water for 10 min) was placed on 
the lip area for 20s. Pinches and thermal stimuli were manually synchronized to 
calcium recordings by applying the stimulus at a designated time during the record- 
ing. Mice underwent a 0.5-mA footshock (0.5-s duration) that was synchronized 
to the calcium imaging recordings by triggering the shock with the calcium imag- 
ing acquisition software (Inscopix, nVista HD) and Ethovision (Noldus, XT 10). 
For tail shock, electrode gel (Spectra 360, Parker Laboratories) was placed on 
the tail and the positive and negative leads (18-ga copper wire) from an electri- 
cal stimulator (model #A320D, World Precision Instruments) were placed 1 cm 
apart. A transistor-transistor logic (TTL) pulse to the stimulator was used to 
synchronize calcium recordings with delivery of an electrical current (2-6 mA, 
2s duration). The hotplate test involved removing the mouse from its home cage 
and placing it onto a pre-heated aluminium plate (15 x 15cm) enclosed within a 
transparent Plexiglas chamber that prevents the mouse from escaping (Coulbourne 
Instruments). We examined temperatures between 40°C and 56°C (20-s duration, 
3-min interval between each temperature gradient, animal placed in home cage 
during 3-min interval). Except for the footshock and hotplate studies, all experi- 
ments were conducted under anaesthesia. Recording sessions were conducted once 
every other day. 

Visceral pain. After a 5-min baseline recording period, awake and freely moving 
mice received an intraperitoneal injection of lipopolysaccharide (LPS; 50 1g/kg; 
Calbiochem, #437650). Calcium activity was recorded 0.5, 1, 2, 3, and 4h after 
injection (2 min duration for each time point). Using a similar experimental design, 
mice received a control saline (0.9%) injection on a separate day. 
Chloroquine-induced itch. Mice were anaesthetized with 1.5-2.0% isoflurane 
(mixed with 0.8 1 per min oxygen). Once the calcium activity was stable under 
baseline conditions, mice received either a chloroquine (1011; 8 mM dissolved in 
0.9% saline; Acros Organics) or saline control injection with a Hamilton syringe 
(26 s ga; #204001). After injection, calcium activity was monitored for 10s every 
minute for 10 min. 

Footshock fear conditioning. A separate cohort of animals was used for the 
fear-conditioning studies. After acclimation to handling and to the miniature 
microscope, mice underwent footshock fear conditioning in shock chamber 
(Med Associates), which involved five cue pairings (3 kHz, 80 dB, 10-s duration) 
that co-terminated with a footshock (0.3 mA, 0.5-s duration, randomized inter- 
trial interval with a range of 70-140s and average of 90s). Sham-conditioned 
control mice did not receive a footshock. One week later, mice were fitted with 
the miniature microscope for calcium recordings, and underwent fear extinction 
in a different but similar-sized chamber (white walls and floor instead of shock 
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chamber that had Plexiglas walls and shock grid floor). During the extinction 
training session, mice received 30 tone cues (in the absence of footshock) using 
the same intertrial interval parameters as during conditioning. A TTL pulse to the 
shock chamber (Med Associates) was used to trigger and synchronize tones with 
calcium recordings. Time spent freezing (during the tone), defined as immobility 
up until any movement of the head or body, was manually scored with a stop- 
watch (experimenter was blind to treatments). For contextual fear conditioning, 
mice were placed in the shock chamber for 15 min on two consecutive days before 
conditioning to measure baseline calcium activity. On the third day, mice were 
placed in the shock chamber and received 10 footshocks (0.6 mA, 0.5-s duration, 
randomized intertrial interval with a range of 70-140 s and average of 90s). On two 
consecutive days after conditioning, mice were placed in the same shock chamber 
(shock grids exposed) for 15 min each day to measure calcium activity after fear 
conditioning. Calcium activity was recorded during the first 5 min of each trial. 
Calcium image processing. Calcium recording files were down-sampled (spatial 
binning factor of 2) to reduce processing time and file size, followed by correc- 
tion of rigid brain movement using the motion correction application (Inscopix, 
Mosaic v1.2). For analysis of recordings conducted in anaesthetized mice, indi- 
vidual component analysis and principle component analysis (ICA/PCA) applica- 
tions were used to identify individual cells and to extract their respective calcium 
traces. Because the microendoscope lens was implanted between the cerebellum 
and brainstem, this resulted in a slightly unstable lens and non-rigid motion that 
could not be corrected with recordings from awake mice. Consequently, we were 
unable to use ICA/PCA to extract traces from individual cells; we manually drew 
small regions of interest to extract calcium traces from cells. Raw traces were con- 
verted to AF/F (F — Foaseline average/ Foaseline average), Where F was the fluorescence 
at any given time-point and Fhaseline average Was the average fluorescence during a 
designated baseline period. Additional analyses were conducted using CNMF for 
microendoscopic data”!. 

CGRP?® TetTox studies. Chloroquine-induced scratching. Approximately one 
month after bilateral PBN injections of AAV1-DIO-GFP:TetTox or AAV1-DIO- 
GEP, mice were habituated to the experimental paradigm for two consecutive days. 
On the test day, mice were placed in a cage with a small amount of bedding for 
40 min, followed by a subcutaneous injection of chloroquine (10 il, 8mM in 0.9% 
saline) in the nape of the neck while scruffing the animal. After returning the 
animal to the cage, behaviour was videotaped for 20 min and scratching bouts 
were analysed. One or many consecutive scratches with any paw were counted as a 
single bout. Scratches interrupted by the animal licking the paw used for scratching 
or putting the paw back on the floor of the cage were counted as separate bouts. 
Adhesive sticker removal test. The same experimental cohort of animals as was used 
to examine chloroquine-induced scratching was also used to determine whether 
inhibition of CGRP?® neurons prevents sticker-induced scratching. One week 
after the itch study, mice were placed in a cage with a small amount of bedding for 
5 min before a red adhesive sticker (1/4 of a Shamrock brand dot label; #SDL-75-21 
dark red) was placed on the back of the animal (near the nape of the neck). After 
the animal was returned to the cage, behaviour was videotaped for 5 min. This 
manipulation induced scratching (similar to chloroquine injection) and behind- 


the-ear swipes with the front paw(s). A scratching bout or a behind-the-ear swipe 
was counted as a single removal attempt. 

Food neophobia. One group of Calca“"”’* mice received bilateral PBN injections 
of AAV1-DIO-GEP:TetTox to inhibit CGRP?®N neurons, whereas a second 
group received control injections of AAV1-DIO-GFP. Three weeks after surgery, 
including a one-week period of acclimation to handling, non-food-deprived mice 
were presented with a novel and palatable food pellet (45 kcal% high-fat diet) for 
20 min. This study was repeated three more times on consecutive days. Mice were 
not food deprived before the study and they had ad libitum access to standard chow 
and water during the experiment. 

Fear conditioning. Prior to fear conditioning or sham conditioning, mice were 
randomly assigned to the following groups: TetTox fear conditioning, GFP fear 
conditioning, TetTox sham conditioning, and GFP sham conditioning. Mice in 
fear conditioning groups underwent footshock fear conditioning, which involved 
10 tone pairings (3 kHz, 80 dB, 10-s duration) that co-terminated with a footshock 
(0.6 mA, 0.5-s duration, randomized intertrial interval with a range of 70-140s 
and average of 90s). Sham-conditioned control mice did not receive a footshock. 
After the conditioning phase, mice received bilateral PBN injections of AAV1-DIO- 
GFP:TetTox or AAV1-DIO-GFP (based on their assignment before conditioning). 
Three weeks later, mice underwent fear extinction training (as described in the cal- 
cium imaging study), followed by a second extinction training session the following 
day. We chose this conditioning paradigm (0.6 mA shock, 10 pairings compared 
to 0.3mA, 5 pairings in calcium imaging study) because the extinction training 
occurred three weeks after conditioning (as opposed to one day for the calcium 
imaging study) to allow for expression of the virus. Fear extinction training was 
video recorded and analysed by an investigator who was blind to the treatment. 
Freezing was defined as complete immobility, including absence of movement of 
the head and nose. 

Statistical analysis. Data were analysed using OriginPro 2016 (OriginLab). 
For calcium imaging studies, sample sizes were based on published studies”*”**. 
Neurons were considered activated if stimuli produced an increase in fluores- 
cent activity three s.d. above baseline fluorescent activity. For CGRP’®N neuron 
inhibition studies, post-hoc histological analysis showed bilateral transduction of 
CGRPP®N neurons with GEP:TetTox or GFP; therefore, all animals were included 
for final analyses. For graphs comparing multiple treatments (one time-point), we 
used repeated-measures one-way ANOVA tests, with Tukey’s post hoc multiple 
comparisons test. Data sets with multiple treatments and time-points were analysed 
with repeated-measures two-way ANOVA tests (time repeated factor), followed 
by Bonferroni's post hoc tests. Two-tailed Student's t-tests were used where 
appropriate. 

Data availability. The datasets generated during the current study are available 
from the corresponding author on reasonable request. 


41. Resendez, S. L. et al. Visualization of cortical, subcortical and deep brain neural 
circuit dynamics during naturalistic mammalian behavior with head-mounted 
microscopes and chronically implanted lenses. Nat. Protocols 11, 566-597 
(2016). 
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and electrical shock. a, b, Percentage of cells activated by tail pinch and anaesthetized mice; baseline was 2-s period before shock. AAV-DIO- 
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d, Average calcium fluorescence activity 1s before (Base) and during a neurons. Line graph is mean + s.e.m. (shaded region). ***P < 0.001, 
footshock (0.5 mA, 0.5s duration; n =5 mice, 317 neurons); bar graph paired Student's t-test (two-tailed). Related to Fig. 1. For statistical 
is mean +s.e.m.. e, Calcium activity time course of CGRP neurons analysis, see Supplementary Information. 


(n=3 mice, 158 neurons) and oxytocin receptor neurons (n = 3 mice, 
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Extended Data Figure 2 | CGRP? neurons encode stimulus intensity. 
a, Plot shows calcium fluorescence activity of neurons in an anaesthetized 
mouse following placement of the tail in a water bath ranging from 

40°C to 52°C (n= 56 neurons). b, Area under the curve of calcium 
activity following placement of tail in 40-52 °C water bath (n=5 mice, 
286 neurons). c, Percentage of cells activated by placement of tail in 56 

or 60°C water in anaesthetized mice; data points are individual mice 

(n=4 mice) and line represents mean. d, Time course and peak neuronal 
responses to tail being placed in 56 or 60°C water (n= 4 mice, 111 
neurons). Baseline was 10-s period before placing tail in water. e, Area 
under the curve of calcium activity following placement of tail in water 
bath. f, Time course and peak neuronal responses to tail receiving graded 


intensities of electrical current (2-6 mA, 2s duration; n =3 mice, 105 
neurons) in anaesthetized mice. Baseline was 2-s period before shock. 

g, Time course and peak calcium activity of neurons in an awake mouse 
(46 neurons) during exposure to a hotplate (40-56 °C, 20-s duration). 
Baseline was 2-min period in home cage before the hotplate test. h, Area 
under the curve of calcium activity following placement of awake mice 
on the hotplate (n =3 mice, 126 neurons). Bar graphs are mean + s.e.m.; 
line graphs are mean + s.e.m. (shaded region). ***P < 0.001; dissimilar 
letters above columns indicate statistical differences between treatments. 
b, h, One-way repeated measures ANOVA, Tukey’s post-hoc; e, paired 
Student’s t-test (two-tailed). Related to Fig. 1. For statistical analysis, see 
Supplementary Information. 
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Extended Data Figure 3 | Additional analysis of calcium imaging from 
fast-refeed study. a, Average calcium fluorescence activity 5s before and 
5s after presentation of standard chow pellet (1 =5 mice, 282 neurons); 
mean +s.e.m.. b, Calcium activity relative to taking a bite (n =5 mice, 
286 neurons). c, d, Plot showing calcium activity (normalized AF/F) 
relative to placement of chow in cage (c) and relative to bite (d). Plots show 
neurons (rows) from all experimental mice (n =5 mice, 282 neurons). 
e, f, Representative calcium traces extracted using CNMF analysis, relative 
to chow availability (e) and taking a bite (f). Baseline was 60-s period 
before chow availability for all data shown. ***P < 0.001; dissimilar letters 
above columns indicate statistical differences between time points; 
a, paired Student's t-test (two-tailed); c, one-way repeated measures 
ANOVA, Tukey’s post-hoc. Related to Fig. 3. For statistical analysis, 
see Supplementary Information. 
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activity in response to a novel, high-fat diet (HFD) pellet. Test trials were neurons; trial 2, n =5 mice, 244 neurons; trial 3, n =5 mice, 254 neurons. 
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neurons (rows) of all experimental mice. d—f, Average calcium activity analysis, see Supplementary Information. 


5s before and 5s after presentation of HFD pellet during three test trials. 
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values for neurons (rows) of all experimental mice. d-f, Average calcium statistical analysis, see Supplementary Information. 
activity 5s before and 10s after placing a marble in the cage. Graphs are 
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Extended Data Figure 7 | Lens placement over CGRP?®N neurons. 


Representative images of lens placement for imaging CGRP neurons 
(green). Dashed lines are approximations of lens placement. Scale bar 
is 100 ,1m. 
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Extensive impact of non-antibiotic drugs 
on human gut bacteria 


Lisa Maier!, Mihaela Pruteanu!}+*, Michael Kuhn2*, Georg Zeller2, Anja Telzerow!, Exene Erin Anderson!, Ana Rita Brochado!, 
Keith Conrad Fernandez!, Hitomi Dose’, Hirotada Mori’, Kiran Raosaheb Patil’, Peer Bork?*+>° & Athanasios Typas!* 


A few commonly used non- antibiotic drugs have recently been associated with changes in gut microbiome composition, 
but the extent of this phenomenon is unknown. Here, we screened more than 1,000 marketed drugs against 40 
representative gut bacterial strains, and found that 24% of the drugs with human targets, including members of all 
therapeutic classes, inhibited the growth of at least one strain in vitro. Particular classes, such as the chemically diverse 
antipsychotics, were overrepresented in this group. The effects of human-targeted drugs on gut bacteria are reflected 
on their antibiotic-like side effects in humans and are concordant with existing human cohort studies. Susceptibility to 
antibiotics and human-targeted drugs correlates across bacterial species, suggesting common resistance mechanisms, 
which we verified for some drugs. The potential risk of non-antibiotics promoting antibiotic resistance warrants further 
exploration. Our results provide a resource for future research on drug-microbiome interactions, opening new paths for 
side effect control and drug repurposing, and broadening our view of antibiotic resistance. 


Pharmaceutical agents have both beneficial and undesirable effects. 
Studies on the mechanisms of action and off-target spectra of vari- 
ous drugs aim to improve their efficacy and reduce their side effects. 
Although many drugs have gastrointestinal side effects and the gut 
microbiome itself is pivotal for human health!, the role of the gut micro- 
biota in these processes is rarely considered. Recently, consumption of 
drugs designed to target human cells and not microbes, such as anti- 
diabetics (metformin7), proton pump inhibitors (PPIs)**, nonsteroi- 
dal anti-inflammatory drugs” and atypical antipsychotics (AAPs)°, has 
been associated with changes in microbiome composition. A larger 
cohort study suggested that medication can alter gut microbiome com- 
position more generally’. As it is unclear whether such effects are direct 
and go beyond the few drug classes studied, we systematically profiled 
interactions between drugs and individual gut bacteria. We aimed to 
generate a comprehensive resource of drug actions on the microbiome, 
which could facilitate more in-depth clinical and mechanistic studies, 
ultimately improving therapy and drug design. 


A high-throughput drug screen on gut bacteria 

To systematically map interactions between drugs and human gut 
bacteria, we monitored the growth of 40 representative isolates upon 
treatment with 1,197 compounds in modified Gifu anaerobic medium 
(mGAM) broth, which partially recapitulates the species relative abun- 
dances in human gut microbiomes®, under anaerobic conditions at 
37°C (Extended Data Fig. 1a). We used the Prestwick Chemical Library, 
which consists mostly of off-patent Federal Drug Administration 
(FDA)-approved compounds with high chemical and pharmacologi- 
cal diversity. Most compounds are administered to humans (1,079), 
and they cover all main therapeutic classes (Supplementary Table 1). 
Three quarters (835) of the compounds are human-targeted drugs 
(that is, have molecular targets in human cells), whereas the rest are 
anti-infectives: 156 with antibacterial activity (144 antibiotics, 12 anti- 
septics) and 88 effective against fungi, viruses or parasites (Fig. la). 


All compounds were screened at 201M, which is within the range of 
what is commonly used in high-throughput drug screens’. 

For our screen to be representative of the gut microbiome of healthy 
individuals, we selected a set of ubiquitous gut bacterial species 
(Supplementary Table 2). Prevalence and abundance in the human gut, 
and phylogenetic diversity, were our main selection criteria (Extended 
Data Fig. 1b), although we were occasionally constrained by strain una- 
vailability or irreproducible growth in mGAM. In total, we included 40 
human gut isolates from 38 bacterial species and 21 genera (Escherichia 
coli and Bacteroides fragilis were represented by two strains each), 
accounting together for 78% of the median assignable relative abun- 
dance of the human gut microbiome at genus level (60% at species level; 
Extended Data Fig. 1c). Most strains were commensals, covering 31 of 
60 sequenced species detected at a relative abundance of 1% or more 
and prevalence of at least 50% in fecal samples from asymptomatic 
humans from three continents (Extended Data Fig. 1d). In addition, 
the set included four pathobionts (Clostridium difficile, Clostridium 
perfringens, Fusobacterium nucleatum and an enterotoxigenic strain 
of B. fragilis), a probiotic (Lactobacillus paracasei) and two commensal 
Clostridia species (C. ramosum and C. saccharolyticum). All 38 species 
are found in the gut of healthy individuals and are part of a larger strain 
resource panel for the healthy human gut microbiome’. 

We screened all compounds in multiwell plates, measuring optical 
density over time to monitor growth, and quantifying the area under 
the growth curve (AUC) up to the time point at which controls with 
unperturbed growth transitioned to stationary phase (see Methods; 
Extended Data Fig. 2). We obtained at least three biological repli- 
cates per strain, and these replicates correlated highly (Extended Data 
Fig. 2c). We then tested for significant deviations from the normalized 
AUC distribution of samples with unperturbed growth, combining 
P values across replicates and correcting for multiple hypothesis testing 
on the complete matrix of compounds and strains (see Methods; 
Extended Data Fig. 2). Drugs that significantly reduced the growth of 
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Figure 1 | Systematic profiling of marketed drugs on a representative 
panel of human gut microbial species. a, Broad impact of 
pharmaceuticals on the human gut microbiota. Compounds from the 
Prestwick Chemical Library are divided into drugs used in humans, drugs 
used exclusively in animals (vet) and compounds without medical or 
veterinary use (non-drugs). Human-use drugs are further categorized 
according to targeted organism. Strain-drug pairs (that is, instances in 
which a drug significantly reduced the growth of a specific strain; see 
Methods) are highlighted with a vertical coloured bar in the matrix. 
Bacterial strains are sorted by drug sensitivity. The relative abundances of 
each strain in four cohort studies of healthy individuals are displayed on 


at least one strain (false discovery rate (FDR) < 0.01), were classified as 
hits with anticommensal activity (Supplementary Table 3a), reflecting 
their potential to modulate the human gut microbiota. 

Of the 156 antibacterials tested, 78% were active against at least one 
species, typically with a broad activity spectrum (Fig. 1a, b). Inactive 
antibiotics belong mainly to the sulfonamides (which are inactive in 
our medium according to the manufacturer's guidelines), aminoglyco- 
sides (which have compromised activity under anaerobic conditions'”) 
and specific antimycobacterial drugs. Antibiotics are used to inhibit 
pathogens, but as expected, also target gut commensals. The medi- 
cal importance of this collateral damage to the resident microbiome 
has recently been becoming clearer!!. Nevertheless, to our knowledge, 
drug-microbiome species relationships have not previously been 
mapped at this scale. 

Notably, 27% of the non-antibiotic drugs were also active in 
our screen. More than half of the anti-infectives against viruses or 
eukaryotes exhibited anticommensal activity (47 drugs; Fig. 1a, b). 
Antibacterial activity has been previously reported for many of these 
drugs, including the antifungal imidazoles’? (11 in our screen), but 
not for others (for example, the antivirals saquinavir and trifluridine). 
More noteworthy is the anticommensal activity of 203 (24%) of the 
human-targeted drugs. Most were effective against only a few strains, 
with the exception of 40 drugs that affected at least 10 strains. Fourteen 
of these had, to our knowledge, not been previously reported to have 
direct antibacterial activity (Supplementary Table 3b). Among known 
human-targeted drugs with anticommensal activity, auranofin has 
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to median relative abundance). b, Fractions of drugs with anticommensal 
activity by sub-category. Grey scale within bars denotes inhibition 
spectrum (the number of affected strains per drug). c, Correlation between 
species abundance in the human microbiome and drug sensitivity. For 
each strain (n = 40), the number of drugs that affect its growth is plotted 
against its median relative abundance in the human gut microbiome. Lines 
depict the best linear fit, rs the Spearman correlation and grey shading 
the 95% confidence interval of the linear fit. All drugs, and in particular 
human-targeted drugs, inhibit the growth of more abundant species more 
than that of less abundant species. 


recently been reported to have broad-spectrum bactericidal activity’, 
and the ovulation stimulant clomiphene inhibits a conserved bacterial 
enzyme in the synthesis of an essential precursor for cell wall carbohy- 
drate polymers". Such drugs or their scaffolds can be used for repur- 
posing towards broad-spectrum antibiotics, especially as many have 
minimal inhibitory concentrations (MICs) in the sub-microgram per 
millilitre range (Supplementary Table 4). By contrast, the microbial 
narrow-spectrum specificity of most human-targeted drugs could aid 
the development of microbiome modulators. 

Bacterial species showed varied responses to drugs, with the abun- 
dant Roseburia intestinalis, Eubacterium rectale and Bacteroides vulgatus 
being the most sensitive, and -\-proteobacteria representatives being 
the most resistant (Fig. 1a). Overall, species with higher relative abun- 
dance across healthy individuals were significantly more susceptible 
to human-targeted drugs (P=0.0012 based on Spearman correlation; 
Fig. 1c). This suggests that human-targeted drugs have an even larger 
impact on the gut microbiome, with key species related to healthy 
status!°, such as major butyrate producers (E. rectale, R. intestinalis, 
Coprococcus comes) and propionate producers (B. vulgatus, Prevotella 
copri, Blautia obeum)"®, and enterotype drivers (P. copri)'’, being 
relatively more affected. 


Dose relevance and validation of the drug screen 

We sought to address how close the screening concentration (201M) 
was to drug concentrations in the terminal ileum and colon, where 
most gut microbes reside'’. However, drug concentrations are 
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Figure 2 | Evaluating human-targeted drugs with anticommensal 
activity. a, Estimated small intestine and colon concentrations and 
measured plasma concentrations of human-targeted drugs with (orange) 
and without (grey) anticommensal activity in our screen (see Methods; 
Extended Data Fig. 3). For both active and inactive compounds, the 
median estimated small intestine and colon concentrations are higher than 
the screened concentration (20 1M, black vertical lines), whereas plasma 
concentrations are lower. Non-hits in our screen generally reached higher 
plasma and small intestine concentrations (two-sided Wilcoxon rank sum 
test). Box plots: centre line, median; limits, upper and lower quartiles; 
whiskers, 1.5 x IQR; points, outliers. b, Rarefaction analysis indicates that 
anticommensal activity would be discovered for more human-targeted 
drugs if we screened additional strains. 


systematically measured only in blood; there, human-targeted drugs 
have on average an order of magnitude lower concentrations than in 
our screen (Fig. 2a, Extended Data Fig. 3). We deduced colon con- 
centrations on the basis of drug excretion patterns from published 
work, and small intestine concentrations on the basis of daily doses of 
individual drugs (Supplementary Table 1) and a measured example of 
duodenal concentrations for the well-absorbed drug posaconazole’® 
(see Methods). Based on these approximations, 201M was below the 
median small intestine and colon concentration of the human-targeted 
drugs tested here (Fig. 2a, Extended Data Fig. 3). Notably, human- 
targeted drugs that showed anticommensal activity had lower plasma 
and estimated small intestinal concentrations than ones with no such 
activity (Fig. 2a; P=0.0061 and P=0.0035, respectively, two-sided 
Wilcoxon rank sum test; we have fewer colon concentration estimates 
owing to data availability), suggesting that more human-targeted drugs 
would inhibit bacterial growth if probed at higher doses, closer to 
physiological concentrations. A case in point is metformin, which was 
recently identified as the key contributor to changes in the human gut 
microbiome composition of patients with type II diabetes’, but lacked 
anticommensal activity in our screen. Metformin reaches 10-40 .M 
in the plasma of treated patients with type II diabetes, but its small 
intestine concentration is 30-300-fold higher”®, which matches our 
estimates of small intestine and colon concentrations (1.5 mM). When 
we probed for higher, more physiological intestinal metformin concen- 
trations, 3 of 22 tested strains were inhibited at concentrations below 
1.5mM (Extended Data Fig. 4a). 

We also benchmarked our screen with an independent set of 
experiments, measuring IC); (the drug concentration conferring 25% 
growth inhibition) for 25 selected drugs in a subset of up to 27 strains 
(see Methods). This analysis revealed excellent precision (94%), but 
slightly lower recall (85%) (Extended Data Fig. 5a, b). False negatives, 
that is, drugs with anticommensal activity missed in our screen, were 
due to specific chemicals that probably lost activity during screening 
(Extended Data Fig. 5d), and our stringent FDR cutoff for calling hits. 
Increasing this cutoff to 0.1 would almost double the fraction of drugs 
with anticommensal activity (Extended Data Fig. 5c). In addition, 
we found that more species were inhibited at higher concentrations 
(Extended Data Fig. 5d, Supplementary Table 4), and that IC2; values 
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were mainly below the estimated gut concentrations and occasionally 
below plasma concentrations (Extended Data Fig. 6). 

Furthermore, we screened only a representative subset of species, 
but the gut microbiome of an individual harbours hundreds of species 
and an even larger strain diversity”!. Rarefaction analysis indicates that 
if more gut species were tested, the fraction of human-targeted drugs 
with anticommensal activity would increase (Fig. 2b). 

In summary, we probed human-targeted drugs largely within physio- 
logically relevant concentrations and our data are likely to under-report 
the impact of human-targeted drugs on gut bacteria. 


Concordance with patient data 

Having demonstrated that many human-targeted drugs inhibit gut 
bacteria in vitro at relevant doses, we searched for evidence that such 
effects manifest in vivo in the human gut. We reviewed all available 
clinical cohort data from metagenomics association studies and com- 
pared it to our screen if studies had enough statistical power and 
affected taxa that overlapped with those tested here. We found suit- 
able studies for PPIs, AAPs, and seven further drugs, spanning alto- 
gether five different drug classes according to Anatomical Therapeutic 
Chemical (ATC) classification. All three PPI representatives in our 
screen exhibited broad anticommensal activity, similar to the microbi- 
ome changes that have been reported in patients taking PPIs*“ (Fig. 3a): 
taxa with reduced abundance in patients exhibited reduced growth in 
our screen and taxa enriched in patients were rarely inhibited by PPIs 
in vitro (Extended Data Fig. 7a). This suggests that PPIs directly influ- 
ence the gut microbiome composition, in addition to changing the stom- 
ach pH and thus affecting which bacteria reach the gut**. Concordance 
was similarly high for many microbe—drug associations identified in a 
large Flemish cohort’ for the immunosuppressive agent azathioprine, 
the antidepressant venlafaxine, the anti-inflammatory mesalazine, 
aminosalicylate, progesterone, oestrogens and amoxicillin; the only 
exception was another antibiotic, nitrofurantoin (Extended Data 
Fig. 7b, c). We also compared our data to a study that reported a reduc- 
tion in Akkermansia levels in the gut of patients treated with AAPs®. 
Our screen included six of the ten AAPs investigated in that study. We 
found that Akkermansia muciniphila was more sensitive than other 
strains to these AAPs (P=0.09; two-sided Wilcoxon rank sum test), 
while being more resistant to other human-targeted drugs (P =0.0005, 
two-sided Wilcoxon rank sum test; Extended Data Fig. 7d). Finally, 
we found high concordance between a longitudinal microbiome study 
of patients taking metformin and our IC25 data for the same drug 
(Extended Data Fig. 4b). 

Metagenomics association studies and our in vitro study have 
distinct limitations. We screened a subset of species, mostly one strain 
per species, out of the context of microbial communities and the host. 
Cohort studies can be underpowered or biased by methodological 
approaches and confounding factors, and may detect indirect effects. 
Nonetheless, we find high concordance between the effects of drugs 
in vitro and in humans, confirming clinical relevance and direct anti- 
commensal activity for the aforementioned cases. 

To assess the physiological relevance of our screen further, we inves- 
tigated the registered side effects of these drugs in humans. We first 
identified side effects enriched in antibiotics for systemic use compared 
to those found in all other drugs in the SIDER database”. We iden- 
tified 69 side effects that were enriched in antibiotics (see Methods; 
Supplementary Table 5). These antibiotic-related side effects occurred 
more often in clinical trials of human-targeted drugs with anticommen- 
sal activity than in trials of compounds that were inactive in our screen 
(P=0.002, two-sided Wilcoxon rank sum test), whereas no significant 
difference was observed for placebo-treated patients (Fig. 3b). This 
suggests that the collateral damage of human-targeted drugs on gut 
bacteria can be detected by higher occurrences of antibiotic-like side 
effects in patients. 

We then tested whether this side effect signature predicted anti- 
commensal activity of human-targeted drugs, which we could have 
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Figure 3 | Anticommensal activity of human-targeted drugs in vitro 
reflects patient data. a, Changes in microbiome composition of patients 
taking PPIs are consistent with drug effects in our screen. Displayed are 
Spearman correlation coefficients between in vitro growth inhibition 

P values and changes in taxonomic relative abundance after PPI 
consumption for corresponding taxa from two studies (Twins UK* and 
Dutch? cohorts; 229 of 1,827 and 211 of 1,815 individuals had taken PPIs, 
respectively). The histogram represents the background distribution of 
correlations between the in vitro data for all human-targeted drugs and 
the in vivo response to PPIs; correlations with PPIs are highlighted by 
triangles. b, Human-targeted drugs with anticommensal activity in our 
screen had a significantly higher incidence of antibiotic-related side effects 
(orange trace shows cumulative distribution, n = 285 drug-side effect 
pairs) in clinical trials compared to drugs without activity (grey trace, 
n=767; P=0.002, two-sided Wilcoxon rank sum test). Dashed lines 
indicate the incidence of the same side effects upon placebo treatment, 
with no significant difference between active (n = 138) and inactive drugs 
(n= 474). c, Based on similarity to antibiotic-related side effects, we 
selected 26 candidate and 16 control drugs for testing for anticommensal 
activity. Although both candidate and control drugs inhibited bacterial 
growth at higher concentrations, candidate drugs had anticommensal 
activity at significantly lower doses than control drugs after normalizing 
for estimated intestine concentrations (P=5.6 x 107’, two-sided Wilcoxon 
rank sum test). Box plots as in Fig. 2a, n denotes number of drug-strain 
pairs. 


missed owing to the low drug concentration we used. We screened 26 
candidate compounds that showed enrichment of antibiotic-related 
side effects and 16 that did not (control compounds) for effects on the 
growth of 18 bacterial strains (Extended Data Fig. 8), in concentrations 
up to 2.5mM (Methods). Twenty-eight of these forty-two compounds 
inhibited the growth of at least one strain (Extended Data Fig. 8a-d), 
with both the fraction of active compounds and the number of affected 
strains being similar for both candidate and control compounds. 
However, when we normalized the measured IC); by the estimated 
intestine concentration (based on the recommended single drug dose) 
to make amounts comparable between drugs, a significant difference 
was evident. Drugs that were predicted to be active had a median IC); 
across all drug-strain pairs that corresponded to 4.3 drug doses, com- 
pared to 12 for control drugs (P=5.6 x 107’, two-sided Wilcoxon rank 
sum test; Fig. 3c). The IC; corresponds to less than two drug doses in 
34% of drugs with predicted activity, compared to just 8% for control 
drugs. Similarly, the IC25 is below the estimated colon concentration 
for 16/52 (31%) of candidate drug-strain pairs and for only 5/50 (10%) 
of control drug-strain pairs (Extended Data Fig. 8e). 
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In conclusion, human-targeted drugs with anticommensal activity 
have antibiotic-like side effects in humans, and for the few studies avail- 
able, consumption of these drugs led to changes in taxa we also detected 
to be inhibited in vitro, implying that more drugs with anticommensal 
activity reported here will have an impact in vivo. 


Features of drugs with anticommensal activity 

Drugs from all major ATC indication areas exhibited anticommensal 
activity, with antineoplastics, hormones and compounds that target 
the nervous system inhibiting gut bacteria more than other medica- 
tions (Extended Data Figs 9a, 10). Three ATC subclasses (antimetabo- 
lites, antipsychotics and calcium-channel blockers) were significantly 
enriched in hits (Extended Data Fig. 9a). Antimetabolites are used as 
chemotherapeutic and immunosuppressant agents, with their incorpo- 
ration into RNA or DNA, or their interaction with synthesis enzymes 
being cytotoxic to human cells. Their molecular targets are often con- 
served in bacteria”’, explaining the observed effects and raising the 
possibility that antibacterial effects may also be directly involved in the 
development of mucositis during chemotherapy”. 

The enrichment in antipsychotics is intriguing, given that they target 
dopamine and serotonin receptors in the brain, which are absent in 
bacteria. Although phenothiazines are known to have antibacterial 
effects”°, nearly all subclasses of the chemically diverse antipsychotics 
exhibited anticommensal activity (Extended Data Fig. 9b). These drugs 
targeted a significantly more similar pattern of species than expected 
from their chemical similarity (P= 2 x 107 ae permutation test; 
Extended Data Fig. 9c). This raises the possibility that direct bacterial 
inhibition may not only manifest as side effect of antipsychotics”, but 
also be part of their mechanism of action. 

As different ATC indication areas contain chemically similar drugs, 
we investigated whether the chemical properties of drugs can influence 
their anticommensal activity (Extended Data Fig. 11a). To some degree, 
chemically similar human-targeted drugs had more similar effects in 
the screen than less similar drugs (Extended Data Fig. 11b). We tested 
several compound properties, including complexity, molecular weight, 
topological polar surface area (TPSA), volume and hydrophobicity 
(XLogP). Complex, heavier and larger compounds preferentially tar- 
geted Gram-positive bacteria, whereas Gram-negative bacteria were 
protected against such bulkier drugs by their selective outer membrane 
barrier (Extended Data Fig. 12). Owing to the vast number of chemical 
moieties present in drugs with anticommensal activity, we did not 
attempt an exhaustive enrichment analysis. Nevertheless, we did 
observe reactive nitro-groups being enriched in drugs with anticom- 
mensal activity (P=6.4 x 10~°, Fisher’s exact test), indicating that local 
chemical properties may confer antibacterial activity. 


Human-targeted drugs may boost antibiotic resistance 
There is a strong correlation between resistance to antibacterials and 
resistance to human-targeted drugs in our data that cannot be explained 
simply by general cell envelope composition, as there is no clear divi- 
sion between Gram-positive and Gram-negative bacteria (Fig. 4a). We 
reasoned that more specific but common mechanisms could confer 
resistance to both drug groups. To test this hypothesis, we selected 
TolC, known to efflux several antibiotics in E. coli and other bacteria”’, 
as a prominent representative of a general resistance mechanism against 
antibiotics. We profiled an E. coli AtolC mutant and its parental wild 
type (BW25113) against the Prestwick Chemical Library. E. coli lacking 
TolC not only became more sensitive to antibacterials (22 hits more 
than wild type), but also became equally more sensitive to human- 
targeted drugs (19 additional hits; Fig. 4a, Supplementary Table 6). 
This effect is not specific to E. coli or TolC, as a more antibiotic- 
resistant B. uniformis strain (HM-715) was also equally more resistant 
to human-targeted drugs (Fig. 4a). 

While our data support a strong role for common general resist- 
ance mechanisms, there are also outliers to this trend, the most prom- 
inent being C. difficile and P. distansonis (Fig. 4a). For both, strong 
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Figure 4 | Antibiotic resistance mechanisms protect against human- 
targeted drugs. a, Susceptibility to antibacterial agents and human- 
targeted drugs correlates across the 40 tested strains (Spearman 
correlation, rs = 0.6 and a line depicting the nonlinear least-squares 
estimate of the odds ratio, OR= 0.06), suggesting common resistance 
mechanisms against both drug types. Knockout of a major antibiotic efflux 
pump (tolC) in the laboratory E. coli strain BW25113 (which behaves like 
the other two commensal E. coli strains in the screen) makes E. coli equally 
more sensitive to both antibacterials and human-targeted drugs. Two 
antibiotic-resistant isolates of B. fragilis (black square, HM-20) and 

B. uniformis (black diamond, HM-715) were screened in addition to the 
main screen, with only the latter showing a similar increase in resistance 
towards human-targeted drugs. b, Chemical genetic screen of an E. coli 


antibiotic resistance*® contrasted with relatively weaker resistance 
to human-targeted drugs. Similarly, an antibiotic-resistant B. fragilis 
strain, HM-20, was not equally resistant against human targeted drugs 
(Fig. 4a). These examples make the important distinction between 
specific antibiotic resistance mechanisms, which are irrelevant for 
resistance to human-targeted drugs, and more predominant, general 
mechanisms, which confer resistance to both drug groups. 

To elucidate mechanisms conferring resistance against human- 
targeted drugs more systematically, we used a chemical genetic 
approach”? and screened a genome-wide overexpression library in 
E. coli against seven non-antibiotics (six human-targeted drugs and 
niclosamide, an antiparasitic) that showed broad anticommensal 
activity in our screen. As wild-type E. coli was one of the most resist- 
ant gut species (Fig. 4a), we used the Afo/C mutant, which is sensitive 
to most of these drugs, allowing us to probe further resistance mech- 
anisms. For all tested drugs except metformin, overexpression of tolC 
rescued E. coli growth, as expected. Furthermore, we identified a num- 
ber of diverse transporter families that contributed to resistance against 
these drugs (Fig. 4b). Many of them have previously been linked to anti- 
biotic resistance**-**. Resistance was also acquired by overexpression 
of transcription factors (for example, rob, which controls efflux pump 
expression*4), the ribosome maturation factor rrmA, which plays a role 
in resistance to the antibiotic viomycin*>, and detoxification mech- 
anisms (nitroreductases modify nitro-containing antibiotics**). For 
methotrexate, we validated the known primary target in bacteria (E. coli 
dihydrofolate reductase)*”, illustrating the potential of this approach 
to identify bacterial mechanism of action of human-targeted drugs”. 

All of these results point to an overlap between resistance mecha- 
nisms against antibiotics and against human-targeted drugs, implying a 
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genome-wide overexpression library in seven non-antibiotics; all screens 
except for metformin were performed in AtolC background to sensitize 
E. coli to these drugs. Genes that when overexpressed significantly 
improved the growth of E. coli in the presence of at least one of the drugs 
are shown here; genes in bold have been previously associated with 
antibiotic resistance. Among them are genes encoding for transporters 
from different families: DMT (drug metabolite transporter), MFS (major 
facilitator superfamily), MATE (multidrug and toxin extrusion), SMR 
(small multidrug resistance) and ABC (ATP-binding cassette). Growth is 
measured by colony size (median n = 4)*°, colour depicts the normalized 
size difference from the median growth of all strains in the drug (more 
than sixfold difference), and dot size the significance (FDR-corrected 
P<0.1). Control denotes the growth of the library without drug. 


hitherto unnoticed risk of acquiring antibiotic resistance by consuming 
non-antibiotic drugs. 


Discussion 
We report a systematic drug screen against a reference panel of 
human gut bacteria. Twenty-seven per cent of non-antibiotics (24% 
of human-targeted drugs) inhibited the growth of at least one species. 
As we demonstrated, this is likely to be an underestimate owing to 
stringent thresholds for calling hits and the limited selection of bacterial 
strains screened. Many of the direct in vitro effects described here may 
translate into microbiome shifts in vivo, because (i) we used concen- 
trations within the range of what is estimated to be found in the human 
gut for many drugs; (ii) our observations agree with the few clinical 
microbiome studies for which medication has been recorded; and 
(iii) the side effects of anticommensal drugs in humans resemble those 
of antibiotics. Thus, our results underscore the necessity of accounting 
for potential medication-related confounding effects in future micro- 
biome disease association studies. Moreover, one could speculate that 
pharmaceuticals, used regularly in our times, may be contributing to a 
decrease in microbiome diversity in modern Western societies*®. 
Although the antibacterial potential of human-targeted drugs has 
been profiled repeatedly in the quest for new antimicrobials, previ- 
ous efforts have focused on pathogenic and often multi-drug-resistant 
(MDR) bacterial species”!*!4. We demonstrate that some of these 
species or their commensal relatives are the most drug-resistant in 
our screen (for example, \-proteobacteria: Bilophila wadsworthia and 
E. coli were affected by 2 and 4-7 human targeted drugs, respectively), 
that many human-targeted drugs have species-specific effects, and 
that resistance mechanisms to antibiotics and human-targeted drugs 
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partially overlap (thus, MDR species may be more resistant to human 
drugs too). Together, these findings explain why previous efforts have 
failed to register how many human-targeted drugs can inhibit bacteria. 

Many pharmaceuticals influence the human gut microbiota. As 
gut bacteria, in turn, can also modulate drug efficacy and toxicity? a 
the emerging drug-microbe network could guide therapy and drug 
development. The resource described here opens up new avenues for 
translational applications in mitigating drug side effects, improving 
drug efficacy, repurposing of human-targeted drugs as antibacterials 
or microbiome modulators, and controlling antibiotic resistance (see 
Supplementary Discussion). However, before any translational appli- 
cation can be pursued, our in vitro findings need to be tested rigorously 
in vivo (in animal models, pharmacokinetic studies and clinical trials) 
and understood better mechanistically. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 
Bacterial strains and growth conditions. Bacterial isolates used in this study were 
purchased from DSMZ, BEI Resources, ATCC and Dupont Health & Nutrition, or 
were gifts from the Denamur Laboratory (INSERM). All strains were recovered in 
their recommended rich medium (resource and literature). The screen and valida- 
tion experiments were performed in mGAM (HyServe GmbH & Co.KG, Germany, 
produced by Nissui Pharmaceuticals)*!, as almost all species could grow robustly 
in this medium in a manner that is reflective of their gut abundance®. Because we 
selected for robust growth, potential positive effects of drugs on growth could not 
be detected. Only one strain was grown in Todd-Hewitt Broth (Sigma-Aldrich), 
one in a 1:1 mixture of mGAM and gut microbiota medium” and, for one strain, 
mGAM was supplemented with 60 mM sodium formate and 10 mM taurine (see 
also Supplementary Table 2). All media were pre-reduced at least 1 day before use 
under anoxic conditions in an anaerobic chamber (Coy Laboratory Products Inc.) 
(2% Hy, 12% COs, rest N2) and all experiments were performed under anaerobic 
conditions at 37°C unless specified otherwise. No statistical methods were used 
to predetermine sample size. 
Species selection. To select a representative core of species in the human gut 
microbiome, we analysed 364 fecal metagenomes from asymptomatic individuals 
from three continents** “®. Species were defined and their abundance quantified as 
previously described*”**. A core set of 60 microbiome species was defined 
(Extended Data Fig. 1b-d), and from this core, 31 species were selected for this 
screen. Seven additional species were selected for reasons explained in the main text. 
Screen of the Prestwick Chemical Library. Preparation of screening plates. The 
Prestwick Chemical Library was purchased from Prestwick Chemical Inc. with 
compounds coming dissolved in dimethyl sulfoxide (DMSO) at a concentration 
of 10mM. Compounds were re-arrayed to redistribute the DMSO control wells in 
each plate and to minimize the total number of 96- and 384-well plates (4 x 384- 
well plates or 14 x 96-well plates). At the same time, drugs were diluted to a con- 
centration of 2mM to facilitate further aliquoting, and these plates were stored at 
—30°C. For each experimental batch (10 replicates in 96-well plates; 20 replicates 
in 384-well plates), we prepared drug plates in the respective growth medium 
(2 for 96-well plates, 1 x for 384-well plates), and stored them at —30°C until use 
(maximum 2 months). Before inoculation, plates were thawed and pre-reduced in 
the anaerobic chamber overnight. The Biomek FXP (Beckman Coulter) liquid han- 
dling system was used for all rearranging and aliquoting of the library compounds. 
Inoculation. Strains were grown twice overnight to make sure we had a robustly 
and uniformly growing culture before inoculating the screening plates. For 96-well 
plates, the second overnight culture was diluted to fresh medium in order to reach 
2x the desired starting optical density (OD) at 578 nm. Next, 5011 of this diluted 
inoculum was added to wells containing 50 1l of 2x concentrated drug in the 
respective culture medium using a multichannel pipetter. The final drug concentra- 
tion was 201M and each well contained 1% DMSO. We inoculated 384-well plates 
with a 384 floating pin replicator VP384FP6S (V&P Scientific, Inc.), transferring 
11 of appropriately diluted overnight culture to wells containing 50 il of growth 
medium, 1% DMSO and 20\M drug. For bacterial species that reached lower OD 
in overnight cultures we transferred twice 1 11 of appropriately adjusted OD culture. 
For both 96- and 384-well plates, the starting OD was 0.01 or 0.05, depending on 
the growth preference of the species (Supplementary Table 2). 
Screening conditions. After inoculation, plates were sealed with breathable membranes 
(Breathe-Easy) to prevent evaporation and cross-contamination between wells, and 
incubated at 37°C without shaking. Growth curves were acquired by tracking OD 
at 578 nm with a microplate spectrophotometer (EON, Biotek). Measurements were 
taken every 1-3h after 30-605 of linear shaking, initially manually but later automat- 
ically using a microplate stacker (Biostack 4, Biotek), fitted inside a custom-made 
incubator (EMBL Mechanical Workshop). We collected measurements for 16-24h. 
Each strain was screened in at least three biological replicates. 
Normalization of growth curves and quantification of growth. Growth curves were 
analysed by plate. All growth curves within a plate were truncated at the time of 
transition from exponential to stationary phase. The end of the exponential phase 
was determined automatically by finding the peak OD (using the median across 
all compounds and control wells, and accounting for a small increase during the 
stationary phase) and verified by inspection. Using this time point allowed us to 
capture the effects of drugs on lag phase, growth rate and stationary phase plateau 
(Extended Data Fig. 2a). Time points with sudden spikes in OD (for example, 
caused by condensation) were removed, and growth curves were discarded com- 
pletely if they had too many missing time points (Extended Data Fig. 2a). Similarly, 
growth curves were discarded if the OD fell far outside the normal range (for 
example, caused by coloured compounds). Three compounds had to be completely 
excluded from the analysis, as they caused aberrant growth curves: Chicago sky 
blue 6B, mitoxantrone and verteporfin. 

Growth curves were processed by plate to set the median OD at the start and 
end time points to 0 and 1, respectively. Then we defined reference compounds 
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across all replicates as those that did not reduce growth significantly for most drugs, 
had measurements for >95% of all replicates, and for which the final OD was 
>0.5 for more than 142 out of 152 replicates. We used these reference compounds 
as representatives of uninhibited growth. As wells containing reference compounds 
outnumbered control wells within a plate, we used control wells only later to 
verify the P value calculation (Extended Data Fig. 2d). After identifying reference 
compounds, we rescaled growth curves such that the median growth of reference 
compounds at the end point was 1. 

While growth curves in control wells and most wells with reference compounds 
followed the expected logistic growth pattern, a variety of deviations were observed 
for drugs that influenced growth. To quantify growth without relying on assump- 
tions about the shape of the growth curve, we calculated the area under the curve 
(AUC) using the trapezoidal rule. Although we set the median starting OD to 0, the 
ODs of individual wells deviated from this. We used two methods to correct for this 
and determine the baseline for each growth curve (Extended Data Fig. 2a). First, a 
constant shift was assumed, subtracting the same shift from all time points of the 
growth curve such that the minimum is zero. Second, an initial perturbation was 
assumed that affects initial time points more than later time points (for example, 
condensation). To correct this, we first subtracted a constant shift as above, and 
then rescaled the curve such that a time point with an uncorrected OD of 1 also 
had an OD of 1 after correction. AUCs were calculated for both scenarios, rescaled 
such that the AUC of reference compounds was 1, and then for each compound 
the baseline correction that yielded an AUC closest to 1 (that is, normal growth) 
was selected. 

AUCs are highly correlated to final ODs, with a Pearson correlation of 0.95 

across all compounds and replicates. Nonetheless, we preferred to use AUCs to 
decrease the influence of the final time point, which will contain more noise than 
a metric based on all time points. 
Identification of drugs with anticommensal activity. We detected hits from nor- 
malized AUC measurements using a statistical method that controls for multiple 
hypothesis testing and varying data quality. We fitted heavy-tailed distributions 
(scaled Student’s t-distribution*’) to the wells containing reference compounds 
for each replicate and, separately, to each individual plate. These distributions 
captured the range of AUCs expected for compounds that did not reduce growth, 
and represented the null hypothesis that a given drug did not cause a growth 
defect in the given replicate or plate. We calculated one-sided P values from the 
cumulative distribution function of the fitted distribution. Within a replicate, each 
compound was associated with two P values: one from the plate on which it was 
measured, and one for the whole replicate. Of those two, the highest P value was 
chosen (conservative estimate) to control for plates with little or high noise, and 
varying levels of noise within the same replicate. 

The resulting P values were well-calibrated (that is, the distribution of P values 
was close to uniform with the exception of a peak at low P values, Extended Data 
Fig. 2d) and captured the distribution of controls, which were not used for fitting the 
distribution and kept for validation. We then combined P values for a given drug and 
strain across replicates using Fisher’s method. Lastly, we calculated the FDR using 
the Benjamini-Hochberg method” over the complete matrix of P values (1,197 
compounds by 40 strains). After inspecting representative AUCs for compound- 
strain pairs at different FDR levels, we chose a conservative FDR cut-off of 0.01. 
Drug indications, dose, and administration. We annotated drugs by their primary 
target organism on the basis of their WHO ATC classification, or, if there were 
uncertainties, based on manual annotation. Compounds were classified as: antibac- 
terial drugs (antibiotics, antiseptics), anti-infective drugs (acting against protozoa, 
fungi, parasites or viruses), human-targeted drugs (that is, drugs whose mechanism 
of action affects human cells), veterinary drugs (used exclusively in animals), and 
finally non-drugs (which can be drug metabolites, drugs used only in research, or 
endogenous substances). If a human-use drug belonged to several classes, the drug 
class was picked according to this order of priority (from high to low): antibacterial, 
anti-infective, and human-targeted drug. This ensured that drugs used also as 
antibacterials were not classified in the other two categories. 

Drugs from the Prestwick Chemical Library were matched against STITCH 
4 identifiers*! using CART. Identifiers that could not be mapped were anno- 
tated manually. Information about drug indications, dose and administration was 
extracted from the ATC classification system and Defined Daily Dose (DDD) 
database. Dose and administration data were also extracted from the Drugs@ 
FDA resource. Doses that were given in grams were converted to mol using the 
molecular weight stated in the Prestwick library information files. When the dose 
guidelines mentioned salt forms, we manually substituted the molecular weight. 
Dose data from Drugs@FDA stated the amount of drug for a single dose (for 
example, a single tablet). Analysing the intersection between Drugs@FDA and 
DDD, we found that the median ratio between the single and daily doses was two. 
To combine the two data sets we therefore estimated the single dose as half of the 
daily dose (Supplementary Table 1). 
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In general, it is difficult to estimate effective drug concentrations in the intestine, 

as those depend on the dose, the speed of dissolution, uptake and metabolization by 
human cells and by bacteria, binding to proteins, and excretion mechanisms into 
the gut. To estimate gut concentrations of drugs based on their dose with a simple 
model, we relied on an in situ study for posaconazole!’. When 40 mg (57 umol) of 
the drug is delivered to the stomach in either an acidic or a neutral solution, the 
maximum concentration in the duodenum reaches 26.3 + 10.3 or 13.6+5.8\.M, 
respectively. This is equivalent to dissolving the drug in 300 ml (240 ml of water to 
swallow the pill as recommended for bioavailability/bioequivalence studies plus 
~43 ml resting water in the small intestine*’) and an absorption rate of 90%. We 
collected doses for as many human-targeted drugs as we could find and used the 
above assumption to estimate small intestine concentrations. To estimate colon 
concentrations, we relied on reported fecal excretion data (Supplementary Table 1, 
gathered from DrugBank 5.0° and across the literature) assuming a single 
daily dose, 24h transit time” and a volume of distribution in the colon of 0.6 1°° 
(Extended Data Fig. 3). 
IC,; determination and screen validation. To validate our screen, we selected 
25 drugs including human-targeted drugs (19), antiprotozoals (3), one antipar- 
asitic, one antiviral and one ‘non-drug’ compound. The human-targeted drugs 
spanned five therapeutic classes (ATC codes A, G, L, M and N). Our selection com- 
prised mostly drugs with extended antibacterial activity in our screen (19 drugs 
hit >10 strains). This bias ensured that we could also evaluate false positives. We 
chose 15 strains to test ICs (that is the minimal concentration of drug that causes 
25% growth inhibition), spanning different phyla (5) and including both sensitive 
(E. rectale, R. intestinalis) and resistant species (E. coli ED1a). 

Compounds for validation were purchased from independent sources 
(Supplementary Table 1) and dissolved at 100 x starting concentration in DMSO. 
Twofold serial dilutions were prepared in 96-well U-bottom plates (as for the 
screen). Each row contained a different drug at eleven twofold dilutions and a 
control DMSO well in the middle of the row (in total eight drugs per plate). These 
master plates were diluted to 2 assay concentration and 2% DMSO in mGAM 
(50,11) and stored at —30°C (<1 month). For the assay, plates were pre-reduced 
overnight in the anaerobic chamber, and mixed with an equal volume (5011) 
of appropriately diluted overnight culture (prepared as described for screening 
section) to reach a starting ODs7s of 0.01 and a DMSO concentration of 1% across 
all wells. ODs7g was measured hourly for 24h after 1 min of shaking. Experiments 
were performed in two biological replicates. 

Growth curves were converted to AUCs as described above, using in-plate 

control wells (no drug) to define normal growth. For each concentration, we cal- 
culated the mean across the two replicates. We further enforced monotonicity to 
conservatively remove noise effects: if the AUC decreased for lower concentrations, 
it was set to the highest AUC measured at higher concentrations. The IC); was 
defined as the lowest concentration for which a mean AUC of below 0.75 was 
measured. In 68% of cases, IC25s were equal between replicates and in a further 
22%, there was a twofold change between replicates, which is within the twofold 
error margin reported for inhibitory concentrations”’. Additionally, MIC as listed 
in Supplementary Table 4 was defined as the lowest concentration for which the 
AUC dropped below 0.1. In the large-scale screen, we detected significant growth 
reductions, which do not necessarily correspond to complete growth inhibition 
(Extended Data Fig. 2b). To ensure comparability between the results of the val- 
idation procedure and the screen, we used the IC; metric for benchmarking. As 
inhibitory concentration calculations are known to have a twofold error margin*”, 
we considered an IC); of 10-40 1M as being in agreement with the screening result 
(Extended Data Fig. 5a, b). A higher number of false negatives implies that more 
human-targeted drugs are likely to have anticommensal activity. 
Analysis of side effects. Side effects of drugs were extracted from the SIDER 
4.1 database” using the mapping between Prestwick compounds and STITCH 4 
identifiers described above. In SIDER, side effects are encoded using the MedDRA 
terminology, which contains lower-level terms and preferred terms. Of these, we 
used the preferred terms, which are more general. We excluded rare side effects 
that occurred for fewer than five drugs from the analysis. Drugs with fewer than 
seven associated side effects were discarded™. In a first pass, we identified side 
effects associated with antibiotics in SIDER, by calculating for each side effect its 
enrichment for systemic antibiotics (ATC code J01) versus all other drugs using 
Fisher's exact test (P value cut-off: 0.05, correcting for multiple hypothesis testing 
using the Benjamini-Hochberg method). Antibiotics are typically administered 
in relatively high doses, and some of the enriched side effects might therefore 
be caused by a dose-dependent effect (for example, kidney toxicity). We there- 
fore used an ANOVA (type II) to test whether the presence of side effects for a 
drug was more strongly associated with it being an antibiotic or with its (log- 
transformed) dose. Side effects that were more strongly associated with the dose 
were excluded from the list of antibiotic-related side effects. 


Data on the incidence rates of side effects in patients was also extracted from 
SIDER 4.1. As different clinical trials can report different incidence rates, we com- 
puted the median incidence rate per drug-side effect pair. As SIDER also contains 
data on the incidence of side effects upon placebo treatment, we were able to ensure 
the absence of systematic biases. 

Experimental validation of side effect-based predictions. Selected candidate and 
control compounds belonged to multiple therapeutic classes (ATC codes A, B, 
C, G, H, L, M,N, S for candidate compounds and A, C, D, G, H, MN, R, S, V for 
control compounds). Compounds of interest were purchased from independent 
sources (Supplementary Table 1) and if possible, dissolved at 5 mM concentra- 
tion in mGAM. Lower concentrations were used when the solubility limit was 
reached. Solutions were sterile filtered, and three fourfold serial dilutions were 
arranged in 96-well plates, aiming at covering a broad range of drug concentrations. 
Inoculation and growth curve acquisition was performed as described for the IC25 
determination experiments. 

Chemical genetics in E. coli. Conjugation of the TransBac overexpression plasmid 
library into E. coli AtolC. The TransBac library, a new E. coli overexpression 
library based on a single-copy vector®® (H.D. and H.M., unpublished resource) 
was conjugated in the BW25113 AtolC::Kan strain. The receiver strain (BW25113 
AtolC::kan) was grown to stationary phase in LB medium, diluted to an ODs7g of 1, 
and 20011 was spread on an LB plate supplemented with 0.3 mM diaminopimelic 
acid (DAP). Plates were dried for 1h at 37°C and then a 1536 colony array of the 
library carried within a donor strain (BW38029 Hfr (CIP8 oriT::cat) dap’ ©) was 
pinned on top of the lawn. Conjugation was carried out at 37°C for ~6h, and the 
first selection was done by pinning on LB plates supplemented with tetracycline 
only (10j1g/ml) and growing overnight. Two more rounds of selection followed 
on LB plates containing both tetracycline (101g/ml) and kanamycin (30 1g/ml) 
to ensure killing of parental strains and select only for tolC mutants carrying the 
different plasmids. 

Chemical genetic screen. The screen was carried out under aerobic conditions on 
solid LB Lennox medium (Difco), supplemented with 30,1g/ml kanamycin, 101g/ml 
tetracycline, the appropriate drug, and 0 or 100|1M IPTG. Drugs were used at 
the following sub-inhibitory concentrations for the tolC mutant: diacerein 201M, 
ethopropazine hydrochloride 160,1M, tamoxifen citrate 20|1M, niclosamide 1.25 1M, 
thioridazine hydrochloride 401M, methotrexate 320 1M, or for the wild type: 
metformin 100 mM. The 1536 colony array of BW25113 AtolC::kan mutant car- 
rying the TransBac collection was pinned on the drug-containing plates, and plates 
were incubated for 16-38 h at 37°C. In the case of metformin we used the version 
of the TransBac library in which each plasmid complements its corresponding 
barcoded single-gene deletion mutant®, since we did not need to use the AtolC 
background to sensitize the cell. Growth of this library was determined at 0 and 
100 mM metformin (both in the presence of 0, 50 and 100|1M IPTG). All plates 
were imaged using an 18-megapixel Canon Rebel T3i and images were processed 
using the Iris software””. 

Data analysis. We used colony size to measure the fitness of the mutants on the 
plate. For standardization of colony sizes, we subtracted the median colony size 
and then divided by a robust estimate of the s.d. (removing outliers below the 
1st and above the 99th percentile). We found edge effects affecting up to five 
rows and columns around the perimeter of the plate. We therefore first stand- 
ardized colony sizes across the whole plate using only colony sizes from the 
inner part of the plate as reference. To remove the edge effects, we subtracted 
from each column its median colony size, and then from each row its median 
colony size. Finally, we standardized the adjusted colony sizes using the whole 
plate as reference. The distribution of adjusted colony sizes was right-skewed 
(that is, more outlier colonies with larger sizes), suggesting a log-normal distri- 
bution. At the same time, the presence of outliers suggested that a logarithmic 
equivalent of the Student's t-distribution with variable degree of freedom” 
would be more suitable. We fitted such a distribution for each plate and cal- 
culated P values for both tails of the distribution. This approach assumes that 
the overexpression of most genes does not affect growth in response to drug 
treatment. P values were combined using Fisher’s method across replicates and 
IPTG concentrations (since we noticed that different IPTG concentrations 
resulted in largely the same results—that is, plasmids are leaky). We corrected 
for multiple hypothesis testing for each drug individually using the Benjamini- 
Hochberg method”. 

Analysis of common resistance mechanisms. To determine a relationship between 
the number of human-targeted drugs (h) and the number of antibacterial drugs 
(a) that affect each strain, we determined the odds ratio (OR): 


OR= H-h 
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where H= 203 and A = 122 are the numbers of human-targeted and antibacterial 
drugs that show activity against any strain, respectively. We computed the nonlinear 
least-squares estimate for OR using the following equation: 


mee OR x — 

H-h A-a 
Data availability. Data are available from FigShare: http://dx.doi.org/10.6084/ 
m9.figshare.4813882. All data generated during this study are included in this 
published article and its Supplementary Information files. 
Code availability. Scripts for analysing data and generating figures are available at 
https://git-embl.de/mkuhn/drug_impact_gut_bacteria. A snapshot of the reposi- 
tory has been deposited together with the data. 
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Extended Data Figure 1 | See next page for caption. 


Extended Data Figure 1 | Screen set-up and species selection. a, Drugs 
from the Prestwick Chemical Library (arranged in either 96- or 384-well 
format) were diluted in growth medium (usually mGAM) and pre-reduced 
in a Coy anaerobic chamber before inoculation with one of forty different 
human gut microbes. Bacterial growth was monitored for 16-24h at 37°C. 
Growth curves were acquired at least in triplicate for each drug—microbe 
interaction (see Methods). b, Species with a minimum relative abundance 
of 1% in at least one sample and a prevalence of 50% across samples 

(the latter estimated by rarefying to 10,000 reads mapping to taxonomic 
markers) were included in the set of core species. Boxplots show relative 
abundances of core species grouped by genus (according to NCBI 
taxonomy) and coloured by phylum (see key). The inner box indicates 

the IQR, with the median as black vertical line; the outer bars extend to 
the 5th and 95th percentiles; circles, outliers. To the right of the boxplots, 
prevalence is depicted by bars, and next to this the species diversity is 
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shown; grey boxes indicate species represented in the screen with box 
widths corresponding to mean relative abundance within the genus. 

c, Relative abundance of genera of which at least one species was 
represented in the screen cumulates to 78% of the assignable fraction of 
reads (median across all samples, upper panel); first four boxplots show 
abundance within each study identified by country codes underneath 
(DK: Denmark; ES: Spain; US: United States; CN: China) *~4°. When 
directly cumulating the relative abundance of represented species the 
corresponding median is 60% (lower panel). Boxes span the IQR and 
whiskers extend to the most extreme data points up to a maximum 

of 1.5 times the IQR. d, Core species are shown in the order of their 
median abundances across all samples. Relative abundance boxplots and 
prevalence bars are defined as in b and grey boxes underneath indicate 
species screened in this study. Numbers in brackets correspond to specl 
cluster identifiers (version 1)*”. 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | Data analysis pipeline for identifying 
compounds with anticommensal activity. a, Schematic overview of the 
data analysis pipeline. All steps (determination of time cutoff and removal 
of noisy points; normalization and selection of reference compounds; 
baseline correction and AUC calculation; and hit calling) are explained in 
detail in the Methods. On the first panel, dashed curves in the righthand 
plot depict the three possible effects that a drug can have on the growth 

of a microbe: increase the lag phase, decrease the growth rate or the 
stationary phase plateau. All effects are captured by cutting off the growth 
curves upon transition to stationary phase for most compounds (most 
drugs do not affect growth). On second panel, median growth rates for 
two drugs on same plate are depicted and normalized, whereas baseline 
correction (third panel) is applied at the individual wells. b, Growth curves 
(top, normalized OD) of Bacteroides ovatus in three exemplary drug cases 
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for the three biological replicates (meclofenamic acid (red), moricizine 
(green) and diacerein (blue)). Light and dark grey shades represent 

the 50% and 90% confidence intervals for normal growth, respectively. 
Bottom, normalized AUC histograms for all drugs in the three biological 
replicates for B. ovatus. Meclofenamic acid is just below the hit threshold, 
moricizine is a hit with partial but strong growth inhibition, and diacerein 
almost completely inhibits the growth of B. ovatus. c, For most species, 
correlation between replicates is very high (median: 0.88). d, For both 
controls and reference compounds, P values were approximately uniformly 
distributed. Determining the background distribution of uninhibited 
growth using reference compounds is validated by their very similar 
behaviour with control wells. Other drugs (that is, drugs not used as 
reference compounds) show clear enrichment of low P values. 
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Extended Data Figure 3 | Anticommensal activity relative to 
compound- and compartment-specific drug concentrations. We made 
a simplified pharmacokinetic estimation of small intestine and colon 
concentrations by assuming that one dose of an orally administered drug 
(extracted from Drugs@FDA and Daily Defined Dose (DDD) of the ATC) 
reaches the intestine and is dissolved or absorbed similarly to the well- 
absorbed drug posaconazole’® (Supplementary Table 1). After absorption 
into the liver via the portal circulation, the drug enters circulation through 
the hepatic veins and reaches its characteristic plasma concentration. The 
two main routes of drug elimination are either secretion via kidneys and 
urine or secretion into the intestine via the biliary duct. In the intestine, 
drugs can be reabsorbed in a circuit called the enterohepatic cycle or 
excreted in stools. Compounds that are either poorly absorbed in the 
small intestine or secreted by bile reach the large intestine. Considering 
the measured excreted fraction of the drug in faeces (both changed and 
unchanged compound, as we do not know whether drug is metabolized 
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in liver or gut), and assuming a large intestinal transit time of 24h* and 

a volume of distribution in the colon of 0.6 1°°, we estimated the colon 
concentrations of the human-targeted drugs in our screen (Supplementary 
Table 1). Histograms for drug dose, plasma concentration, estimated small 
intestine concentration, urinary and fecal excretion and estimated colon 
concentration depict the respective distributions for human-targeted 
drugs, colour coded according to their anticommensal behaviour in our 
screen. Dashed lines indicate medians and vertical lines highlight the drug 
concentration used in our screen (20M). Interactions between drugs 

and microbiota are possible throughout the entire gastrointestinal tract, 
with microbial load having a gradient-like distribution (ileum and colon 
containing the largest numbers); this can be disturbed during disease'®. 

In addition, drugs can be modified at several stages: by host digestive and 
intestinal epithelial enzymes, by phase I and phase II metabolism in the 
liver and by microbial enzymes. Some of these processes neutralize each 
other, resulting in reconversion into the original compound. 
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and recall (85%). We considered IC; as the lowest concentration that 
reduces growth by at least 25% (see Methods). Breakdown into active and 
inactive compounds for drugs concentrations at the 20|1M concentration, 
used in our screen. True positives (TP), green; false positives (FP), red; 
true negatives (TN), grey; false negatives (FN), blue. c, Number of drugs 
with anticommensal activity versus the applied FDR threshold for all 
compounds (left) and human-targeted drugs (right). Increasing the FDR 
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fraction of drugs that affect human gut microbes. d, IC5s of 25 drugs in 
up to 27 individual strains (see also a, b). The white area indicates the drug 
concentration range tested for each drug. Symbol sizes depict the number 
of strains with a particular IC25, symbol colours indicate categorization 
into false negative, false positive, true negative and true positive, and 
symbol shapes qualify whether actual IC25s were determined or IC25 was 
deemed to be higher or lower than the highest or lowest concentration 
tested, respectively. Vertical line indicates the drug concentration used in 
screen (201M). IC)ss for all drug-—strain pairs are listed in Supplementary 
Table 4. Particular drugs were responsible for false negatives in our screen 
(acarbose, loperamide, thioridazine), presumably owing to drug decay. 


of Springer Nature. All rights reserved. 


ARTICLE 


Auranofin 

Niclosamide 

Nifurtimox 

5-fluorouracil 
Streptozotocin 
Pyrimethamine 

Dienestrol 

Tamoxifen citrate 
Diethylstilbestrol 
roglitazone 

Thioridazine hydrochloride 
Mefloquine hydrochloride 
Zidovudine, AZT 
Clomiphene citrate (Z,E) 
Ethopropazine hydrochloride 
Estradiol Valerate 
Loxapine succinate 
Rabeprazole Sodium salt 
Omeprazole 
Lansoprazole 

Acarbose 

Loperamide hydrochloride 


Small intestine 


[o) 
— 
(2) 
ye) 
[o) 


30 


Plasma Colon 
pa Es 
ea 
ai 
EBA 
=== ai 
ES | 
Say 
Es 
ie] 
a 
j= iz 
= 
= ia 
fal & 
|_| ini] 
Ea = 
| 
0 10 20 30 (6) 10 20 30 


Count of affected strains per drug 


1Cz5 not determined ey IC25 above body concentration i IC25 below body concentration 


Extended Data Figure 6 | IC25 relation to drug concentrations in human 
body. For drug-strain pairs with measured IC)ss (see also Extended Data 
Fig. 5), we compared IC;5s with plasma and estimated small intestine and 
colon concentrations by plotting the number of strains that are affected in 
relation to whether they are above or below relevant body concentrations 


(colour code). With the exception of oestradiol valerate and 5-FU 

(only plasma concentrations available), all other drugs with available body 
concentrations reach concentrations high enough in the body to reach 
their IC; for at least one gut microbial species (out of up to 27 species 
tested for IC)ss). 
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Extended Data Figure 7 | Concordance of drug in vitro species 
susceptibilities and drug-mediated shifts in microbiome composition 
of patients. a, Association coefficients between PPI usage and relative 
taxonomical abundance in faecal microbiomes of PPI users from two 
studies (twins, UK cohort, green’; three independent cohorts from the 
Netherlands’, blue) (left) are compared to in vitro growth inhibition of 
isolates with same taxonomic rank in the presence of PPIs (omeprazole, 
lansoprazole and rabeprazole) as assessed by FDR-adjusted P values 

(q values) in our screen (right). Point size in the left panel corresponds 
to the q value as reported in the original study. Taxa that were reduced in 
patients (negative association coefficient, left of vertical black line) were 
mostly inhibited by PPIs in our screen (q value below 0.01, left of vertical 
black line), whereas enriched taxa were insensitive to PPIs. Box plots show: 
centre line, median; box limits, upper and lower quartiles; whiskers, 

1.5 IQR; points, outliers. For fewer than 10 data points, all points 

are shown individually. b, Spearman correlation coefficients between 
association coefficients of faecal microbiome composition after 
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The histogram represents the background distribution of correlations 
between the in vitro data for all human-targeted drugs and the in vivo 
response to these drugs; correlations with amoxicillin or azathioprine are 
highlighted by triangles c, Comparisons between association coefficients 
and drugs from different therapeutic classes as assessed by Falony 

et al.’ and our in vitro data. d, A study of a cohort of patients with bipolar 
disease® reported a significant decrease in abundance of Akkermansia 
upon treatment with atypical antipsychotics (AAP). When we compared 
distributions of adjusted P values from our screen for different strains, 
Akkermansia muciniphila was significantly more sensitive than all other 
strains to antipsychotics in general and APP in particular (P= 0.02 and 
P=0.09, two-sided Wilcoxon rank sum test). By contrast, A. muciniphila 
is relatively more resistant than other strains across all human-targeted 
drugs (P= 0.0005, two-sided Wilcoxon rank sum test). Violin plot shows 
estimated density of points with the estimated median as vertical bar. For 
fewer than 10 data points, all points are also shown individually. 
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Extended Data Figure 8 | Evaluation of anticommensal activity 
predictions based on side-effects. a, IC)5s of 26 candidate compounds 

(P value for enrichment of antibiotic-related side effects <1 x 1074, using a 
one-sided Fisher's exact test) and 16 control compounds (see also d of this 
Figure) were determined for 18 representative strains; results are depicted 
as an IC); heatmap. Drugs are ordered according to their similarity in side 
effects to antibiotics from left to right (for antibiotic-related side effects 
see Supplementary Table 5). Qualifiers indicate whether ICs are higher 
or lower than the indicated concentration; if no symbol, the box depicts 
the exact ICs. If highest tested concentrations did not reduce growth 

of any of the tested strains, the compound was classified as inactive (for 
example, Topiramate). b, Dose of the tested compounds according to the 
Defined Daily Dose and Drugs@FDA databases (see also Supplementary 
Table 1). c, Based on a compound’s recommended dose and its median 
IC; for different bacterial strains, we estimated the number of doses need 
to reach this IC25. This number was plotted against the drug’s P value 

for enrichment of antibiotic-related side effects. For direct comparison 
between the two groups, see Fig. 3c. Circles in magenta depict drug-strain 
pairs for which growth was reduced, showing a clear correlation between 
P values and the estimated number of doses (magenta line). To rule out the 


possibility that the tested concentration range is causing this correlation, 
we also depict the estimated number of doses corresponding to the highest 
tested concentration (grey line), which exhibits no clear dependency 
between P value and number of doses. A vertical line across all panels 
connects all parameters attributable to a particular drug. d, Recommended 
single drug doses of human-targeted drugs with no anticommensal 
activity in our screen plotted against enrichment in antibiotic-related side 
effects (n = 339). Candidate and control drugs selection for testing for 
anticommensal activity at higher concentrations were selected on the basis 
of similarity to antibiotic-related side effects (vertical black line depicts 
prediction threshold) and aiming at drugs used at higher doses than 
concentration in our screen (horizontal dashed line). Purple and dark grey 
triangles indicate hits and non-hits from this validation effort, respectively. 
e, Ratios between IC); and estimated colon concentrations are significantly 
lower (P= 0.017, two-sided Wilcoxon rank sum test) for candidate drugs 
than for control drugs. For candidate drugs, 16 of 52 (31%) IC25s were 
below the estimated colon concentrations while for control drugs this 
fraction was only 5 of 50 (10%). Box plots show: centre line, median; box 
limits, upper and lower quartiles; whiskers, 1.5 IQR; points, outliers. 
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Extended Data Figure 9 | Drug therapeutic classes with anticommensal 
activity. a, Fraction of drugs with anticommensal activity by ATC 
indication area (bars). All first-level indication areas and significantly 
enriched lower levels are shown (see also Extended Data Fig. 10). 
Significance (P value, one-sided Fischer’s exact test) is controlled for 
multiple hypothesis testing (Benjamini-Hochberg) independently at each 
ATC hierarchy level. b, Heat map of anticommensal activity and chemical 
similarities of human-targeted drugs within the three significantly ATC 
indication levels from a (indicated by circled numbers). Colours represent 
the median of drug pairwise Spearman correlations within and between 
subgroups depicted, calculated from the growth profiles of the 40 strains 
in each drug (P values) or their chemical similarity (Tanimoto scores”). 
Examples of structurally similar (phenothiazines; NOSAA-AC) and diverse 
(NOSAF-AX) antipsychotics that elicit similar responses in our screen 

are marked. c, Antipsychotics exhibit higher similarity in gut microbes 
they target than that expected on the basis of their structural similarity 
(P=2 x 10)’ estimated from random permutations; other classes 
depicted show no significance difference). Box plots show: centre line, 
median; box limits, upper and lower quartiles; whiskers, 1.5 IQR; points, 
outliers. Notches correspond roughly to a 95% confidence interval for 
comparing medians. 
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Extended Data Figure 10 | Drugs with anticommensal activity for all 
hierarchy levels of the ATC classification system. Fraction of drugs with 
anticommensal activity for all indication areas of the ATC classification 
scheme with a high fraction of active compounds. Shown are indication 
areas that contain at least two active compounds and a fraction of at least 
50% active compounds, their parent terms and all top-level indication 
areas. Significance (P value, one-sided Fischer's exact test) is indicated by 


1e-05 


te-03 


te-01 


the bar colour and corrected for multiple hypothesis testing (Benjamini- 
Hochberg) independently at each hierarchy level of the ATC. Many smaller 
classes, including PPIs (A02BC), non-selective calcium channel blockers 
(CO8E), synthetic oestrogens (G03CB), leukotriene receptor antagonists 
(RO3DC) and phenothiazine and other antihistamines (RO6AD and 
RO6AX) are enriched, but owing to multiple testing and the small numbers 
of drugs tested in each group, they do not reach a significant P value. 
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Extended Data Figure 11 | Comparing chemical similarity of drugs and have similar effects on the 40 strains—for example, phenothiazine 
and similarity of hit profiles across gut microbes. a, Heat map of antipsychotics and antihistamines (NO5A and RO6AD), structurally 
anticommensal activity and chemical similarities for all active human- similar dibenzothiazepines and dibenzoxazepines for antipsychotics and 
targeted drugs in our screen. Drugs are clustered according to chemical antidepressants (NO5AH and NO6AA), PPIs (A02BC), anti-oestrogens 
similarity. Colours represent the median of drug pairwise Spearman (LO2BA), synthetic oestrogens (G03CB) and anti-inflammatory fenamates 
correlations within and between subgroups depicted, calculated from (M01AG and M02AA06). b, A mild correlation exists between chemical 
the growth profiles of the 40 strains in each drug (P values) or their similarity (Tanimoto scores) and anticommensal activity similarity (drug 
chemical similarity (Tanimoto scores®™). Several prominent groups are pairwise Spearman correlations): r; = 0.12 (P value of Spearman's test 
colour coded. Only drugs of some classes both share chemical similarity <2 x 107'). 
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A galaxy lacking dark matter 


Pieter van Dokkum!, Shany Danieli!, Yotam Cohen!, Allison Merritt!*, Aaron J. Romanowsky**, Roberto Abraham>, 
Jean Brodie’, Charlie Conroy®, Deborah Lokhorst?, Lamiya Mowla!, Ewan O’Sullivan® & Jielai Zhang” 


Studies of galaxy surveys in the context of the cold dark matter 
paradigm have shown that the mass of the dark matter halo and 
the total stellar mass are coupled through a function that varies 
smoothly with mass. Their average ratio Mpato/Mstars has a minimum 
of about 30 for galaxies with stellar masses near that of the Milky 
Way (approximately 5 x 10'° solar masses) and increases both 
towards lower masses and towards higher masses!”. The scatter 
in this relation is not well known; it is generally thought to be less 
than a factor of two for massive galaxies but much larger for dwarf 
galaxies**. Here we report the radial velocities of ten luminous 
globular-cluster-like objects in the ultra-diffuse galaxy> NGC1052- 
DF2, which has a stellar mass of approximately 2 x 10° solar masses. 
We infer that its velocity dispersion is less than 10.5 kilometres per 
second with 90 per cent confidence, and we determine from this 
that its total mass within a radius of 7.6 kiloparsecs is less than 
3.4 X 108 solar masses. This implies that the ratio Mnato/Metars is of 
order unity (and consistent with zero), a factor of at least 400 lower 
than expected”. NGC1052-DF2 demonstrates that dark matter is not 
always coupled with baryonic matter on galactic scales. 

NGC1052-DF? was identified with the Dragonfly Telephoto Array® 
in deep, wide-field imaging observations of the NGC 1052 group. The 
galaxy is not a new discovery; it was catalogued previously in a visual 
search of digitized photographic plates’. It stood out to us because of 
the remarkable contrast between its appearance in Dragonfly images 
and Sloan Digital Sky Survey (SDSS) data: with Dragonfly it is a low- 
surface-brightness object with some substructure and a spatial extent 
of about 2’, whereas in SDSS it appears as a collection of point-like 
sources. Intrigued by the likelihood that these compact sources are 
associated with the low-surface-brightness object, we obtained 
follow-up spectroscopic observations of NGC1052-DF2 using the 
10-m W. M. Keck Observatory. We also observed the galaxy with the 
Hubble Space Telescope (HST). 

A colour image generated from the HST Véo¢ and Igi4 data is shown 
in Fig. 1. The galaxy has a striking appearance. In terms of its apparent 
size and surface brightness, it resembles dwarf spheroidal galaxies such 
as those recently identified® in the M101 group at 7 Mpc, but the fact 
that it is only marginally resolved implies that it is at a much greater 
distance. Using the Ig,, band image, we derived a surface-brightness- 
fluctuation distance of Dspp = 19.0 + 1.7 Mpc (see Methods). It is 
located only 14’ from the luminous elliptical galaxy NGC 1052, 
which has distance measurements ranging from 19.4 Mpc to 21.4 Mpc 
(refs 9, 10). We infer that NGC1052-DF? is associated with NGC 1052, 
and we adopt D = 20 Mpc for the galaxy. 

We parameterized the galaxy’s structure with a two-dimensional 
Sérsic profile’. The Sérsic index is n = 0.6, the axis ratio is b/a =0.85, 
the central surface brightness is ju( V606,0) = 24.4 mag arcsec”, and 
the effective radius along the major axis is R.= 22.6”, or 2.2 kpc. We 
conclude that NGC1052-DF2 falls in the ‘ultra-diffuse galaxy’ (UDG) 
class®, which have R. > 1.5 kpc and u(g, 0) > 24 magarcsec*. In terms 
of its structural parameters, it is very similar to the galaxy Dragonfly 17 


in the Coma cluster®. The total magnitude of NGC1052-DF2 is Mens = 
—15.4, and the total luminosity is Ly = 1.1 x 10° solar luminosities, Lo. 
Its colour Veo¢ — Igi4 = 0.37 £0.05 in the AB magnitude system, similar 
to that of other UDGs and metal-poor globular clusters'”. The stellar 
mass was determined in two ways: by placing a stellar population at 
D=20 Mpc that matches the global properties of NGC1052-DF2 (see 
Methods), and by assuming M/Ly =2.0 as found for globular clusters’. 
Both methods give Mgtars*2 x 10° solar masses, Mo. 

We obtained spectroscopy of objects in the NGC1052-DF? field with 
the W. M. Keck Observatory. Details of the observations and data 
reduction are given in the Methods section. We found ten objects with 
a radial velocity close to 1,800 kms! (all other objects are Milky Way 
stars or background galaxies). We conclude that there is indeed a popu- 
lation of compact, luminous objects associated with NGC1052-DF2. 
Their spectra near the strongest calcium triplet lines are shown in 
Fig. 2. The mean velocity of the ten objects is (v)=1,803'5kms~!. The 


Figure 1 | HST/Advanced Camera for Surveys (ACS) image of 
NGC1052-DF2. NGC1052-DF2 was identified as a large (approximately 2’) 
low-surface-brightness object, at right ascension a =2h 41 min 46.85, 
declination 6= —8° 24’ 12” (J2000). HST imaging of NGC1052-DF2 was 
obtained on 2016 November 10, using the ACS. The exposure time was 2,180s 
in the Véo¢ filter and 2,320s in the Ig14 filter. The image spans 3.2’ x 3.2’, or 

18.6 kpc x 18.6 kpc at the distance of NGC1052-DF2; north is up and east is to 
the left. Faint striping is caused by imperfect charge-transfer-efficiency removal. 
Ten spectroscopically confirmed luminous compact objects are marked. 
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Figure 2 | Spectra of the compact objects. The square panels show the 
HST/ACS images of the ten confirmed compact objects. Each panel spans 
1.7” x 1.7", or 165 pe x 165 pc at the distance of NGC1052-DF2. The Keck 
spectra are shown next to the corresponding HST images. The regions 
around the reddest A8,544.4, \8,664.5 Ca 11 triplet lines are shown, as 
illustrated in the model spectrum at the top; the A8,500.4 Ca triplet line 
was included in the fit but falls on a sky line for the radial velocity of these 
objects. The spectra were obtained with the Low Resolution Imaging 


NGC 1052 group has a radial velocity of 1,425 km s |, witha lo spread 
of only +111kms™' (based on 21 galaxies). NGC1052-DF2 has a pecu- 
liar velocity of +378kms_! (3.4¢) with respect to the group and + 
293kms_! with respect to NGC 1052 itself (Fig. 3). 

Images of the compact objects are shown in Fig. 2, and their loca- 
tions are marked on Fig. 1. Their spatial distribution is somewhat 
more extended than that of the smooth galaxy light: their half-number 
radius is Rg. 3.1 kpc (compared with effective radius R, =2.2kpc for 
the light) and the outermost object is at Rout=7.6 kpc. In this respect, 
and in their compact morphologies (they are just-resolved in our HST 
images, as expected for their distance) and colours, they are similar to 
globular clusters and we will refer to them as such. Their luminosities 
are, however, much higher than those of typical globular clusters. The 
brightest (GC-73) has an absolute magnitude of Mgog = —10.1, similar 
to that of the brightest globular cluster in the Milky Way (w Centauri). 
Furthermore, the galaxy has no statistically significant population of 
globular clusters near the canonical peak of the luminosity function at 
My = —7.5. The properties of these enigmatic objects are the subject of 
another paper (P.v.D. et al., manuscript in preparation). 

The central observational result of the present study is the remarka- 
bly small spread among the velocities of the ten clusters (Fig. 3). The 
observed velocity dispersion is 7.,;=8-4kms~!, as measured with the 
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Spectrometer (LRIS), the Deep Imaging Multi-Object Spectrograph 
(DEIMOS) or both. The spectral resolution is instr ¥30kms~!. 
Uncertainties in the spectra are in grey. The signal-to-noise ratio ranges 
from 3.4 per pixel to 12.8 per pixel, with 0.4 A pixels. The red lines show 
the best-fitting models. Radial velocities cz are indicated, as well as the 
velocity offset with respect to the central (v) = 1,803 kms~!. This velocity is 
indicated with dashed vertical lines. 


biweight estimator (see Methods). This value is much smaller than that 
in previously studied (cluster) UDGs'*"* and not much higher than the 
expectation from observational errors alone. Taking the errors into 
account, we find an intrinsic dispersion of Ointr 39139 kms~!. The 
90% confidence upper limit is Jintr << 10.5km s_!. Within the Local 
Group, typical galaxies with velocity dispersions in this range are small 
(R.& 200 pc) and have a low stellar mass (Metars © (2-3) x 10°M.)}°. 
The average velocity dispersion of Local Group galaxies with 
8.0 <log(M/Ma) < 8.6 is 32 kms~! (dotted curve in Fig. 3a). 

We calculate the corresponding 90% confidence upper limit on 
the mass of NGC1052-DF2 using the tracer mass estimator (TME) 
method!®, which provides an estimate of the dynamical mass, May 
within the radius of the outermost discrete tracer, Roy. We find 
Mayn < 3.4 X 10°M.5 within Rout= 7.6 kpc. We also determine the 
dynamical mass within the projected half-number radius of the globular 
cluster system’ and find Mayn < 3.2 X 108M, within Ry. =3.1 kpe. 

In Fig. 4a, the enclosed mass is compared to the expected mass from 
the stars alone (orange line) and to models with different halo masses. 
The dynamical mass is consistent with the stellar mass, leaving little 
room for dark matter. The best fit to the kinematics is obtained for 
Mhalo = 0, and the 90% confidence upper limit on the dark matter halo 
mass is Mpalo < 1.5 x 10°M. We note that the combination of the large 
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Figure 3 | Velocity dispersion. The filled grey histograms show the 
velocity distribution of the ten compact objects. a, Wide velocity range 
including the velocities of all 21 galaxies in the NASA/IPAC Extragalactic 
Database with cz < 2,500kms7! that are within a projected distance of two 
degrees from NGC 1052. The red dotted curve shows a Gaussian with a 
width of o=32kms |, the average velocity dispersion of Local Group 
galaxies with 8.0 < log(Motars/Ma) < 8.6. b, Narrow velocity range centred 
on cz=1,803kms~!. The red solid curve is a Gaussian with a width that is 
equal to the biweight dispersion of the velocity distribution of the compact 
objects, Top; = 8.4kms~!. Taking observational errors into account, 

we derive an intrinsic dispersion of dint: = a7? kms7!, with the 
uncertainties 1 s.d. The 90% confidence upper limit on the intrinsic 
dispersion is Cintr<10.5kms_!. 


spatial extent and low dynamical mass of NGC1052-DF2 yields an 
unusually robust constraint on the total halo mass. Typically, kinematic 
tracers are only available out to a small fraction of the virial radius, 
and a large extrapolation is required to convert the measured enclosed 
mass to a total halo mass*. However, for a halo of mass Moo) + 10°M qs, 
the virial radius is only about 10 kpc, similar to the radius at which the 
outermost globular clusters reside. As shown in Fig. 4b, a galaxy with 
a stellar mass of Mstars=2 X 10°M,q is expected to have a halo mass of 
Mhalo 6 X 10!°M 5, a factor of about 400 higher than the upper limit 
that we derive. We conclude that NGC1052-DF2 is extremely deficient 
in dark matter, and a good candidate for a ‘baryonic galaxy’ with no 
dark matter at all. 

It is unknown how the galaxy was formed. One possibility is that it 
is an old tidal dwarf, formed from gas that was flung out of merging 
galaxies. Its location near an elliptical galaxy and its high peculiar 
velocity are consistent with this idea. Its relatively blue colour suggests 
a lower metallicity than might be expected for such objects!’, but that 
depends on the detailed circumstances of its formation'®. An alternative 
explanation is that the galaxy formed from low-metallicity gas that was 
swept up in quasar winds”’. The lack of dark matter, the location near 
a massive elliptical, the peculiar velocity and the colour are all quali- 
tatively consistent with this scenario, although it is not clear whether 
the large size and low surface brightness of NGC1052-DF2 could have 
been produced by this process. A third option is that the galaxy formed 
from inflowing gas that fragmented before reaching NGC 1052, either 
relatively close to the assembling galaxy”! or out in the halo””. This 
fragmentation may have been aided or precipitated by jet-induced 
shocks”. In any scenario, the luminous globular-cluster-like objects 
require an explanation; generically, it seems likely that the three pecu- 
liar aspects of the galaxy (its large size, its low dark matter content and 
its population of luminous compact objects) are related. An important 
missing piece of information is the number density of galaxies such as 
NGC1052-DF2. There are several other objects in our Cycle 24 HST 
programme that look broadly similar, but these do not have dynamical 
measurements yet—and the fact that other UDGs have anomalously 
high, rather than low, dark matter fractions!*!4 demonstrates that such 
data are needed to interpret these galaxies. 

Regardless of the formation history of NGC1052-DF2, its existence 
has implications for the dark matter paradigm. Our results demonstrate 
that dark matter is separable from galaxies, which is (under certain 
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Figure 4 | Constraints on the halo mass. a, Enclosed mass profiles for 
Navarro—Frenk—White haloes” of masses M299 = 10°Mq, 10°Mo, 10!°Ms 
and 10''Mz (grey lines). The 10°M,z halo profile is shown by a dotted line 
beyond R= Ry99 © 10 kpc. The orange curve is the enclosed mass profile 
for the stellar component, and the black curves are the total mass profiles 
Mtotal = Mstars + Mhalo- The ten globular clusters are at distances ranging 
from R=0.4kpc to R=7.6 kpc; short vertical bars on the horizontal axis 
indicate the locations of individual clusters. The 90% upper limits on the 
total enclosed mass of NGC1052-DF2 are shown by arrows. The limit 

at R=7.6 kpc was determined with the TME method"®. The arrow at 
R=3.1 kpc is the mass limit within the half-number radius of the globular 
cluster system!”. The dynamical mass of NGC1052-DF2 is consistent 
with the stellar mass and leaves little room for a dark matter halo. b, The 
upper limit on the halo mass and comparison to the expected dark matter 
mass from studies that model the halo mass function and the evolution 

of galaxies”*°. Grey solid symbols are nearby dwarf galaxies with rotation 
curves extending to at least two disk scale lengths*. Open squares are three 
cluster UDGs with measured kinematics: VCC 1287", Dragonfly 44’? and 
DFX1'?. NGC1052-DF?2 falls a factor of at least 400 below the canonical 
relations. 


circumstances) expected if it is bound to baryons through nothing 
but gravity. The ‘bullet cluster demonstrates that dark matter does not 
always trace the bulk of the baryonic mass”, which in clusters is in the 
form of gas. NGC1052-DF2 enables us to make the complementary 
point that dark matter does not always coincide with galaxies either: 
it is a distinct ‘substance’ that may or may not be present in a galaxy. 
Furthermore, and paradoxically, the existence of NGC1052-DF2 
may falsify alternatives to dark matter. In theories such as modified 
Newtonian dynamics (MOND)* and the recently proposed emergent 
gravity paradigm”®, a ‘dark matter’ signature should always be detected, 
as it is an unavoidable consequence of the presence of ordinary matter. In 
fact, ithad been argued previously” that the apparent absence of galaxies 
such as NGC1052-DF2 constituted a falsification of the standard 
cosmological model and offered evidence for modified gravity. For a 
MOND acceleration scale of ay =3.7 x 10°km*s *kpc7!, the expected”8 
velocity dispersion of NGC1052—-DF2 is oy © (0.05GM tars) 4/4 & 
20kms~!, where G is the gravitational constant—a factor of two higher 
than the 90% upper limit on the observed dispersion. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Imaging. In this paper, we use imaging from the Dragonfly Telephoto Array, the 
Sloan Digital Sky Survey, the MMT, the Gemini North telescope and the Hubble 
Space Telescope. 

Dragonfly. The Dragonfly Telephoto Array® data were obtained in the context of the 
Dragonfly Nearby Galaxy Survey*". Dragonfly was operating with eight telephoto 
lenses at the time of the observations, forming the optical equivalent of an f/1.0 
refractor with a 40-cm aperture. The data reach a 1o surface brightness limit of 
ju(g) © 29 magarcsec in 12" x 12” boxes*!. The full Dragonfly field is shown in 
Extended Data Fig. 1, as well as the area around NGC 1052 and NGC1052-DF2. 
SDSS. SDSS images in the g, r and i bands were obtained from the DR14 Sky 
Server®’. To generate the image in Extended Data Fig. 4, the data in the three bands 
were summed without weighting or scaling. The object is located near the corner of 
a frame. We note that the SDSS photometry for the compact objects is not reliable, 
as it is ‘contaminated’ by the low surface brightness emission of the galaxy (which 
is just detected in SDSS). 

MMT. MMT/Megacam imaging of the NGC 1052 field was available from a project 
to image the globular cluster systems of nearby early-type galaxies**. The data were 
taken in the rand i bands, in 0.9” seeing. They were used for target selection in our 
first Keck spectroscopic run. 

Gemini. We obtained imaging with the Gemini-North Multi Object Spectrograph™ 
(GMOS) in programme GN-2016B-DD-3. The observations were made on 2016 
October 10, with total exposure times of 3,000 s in the g band and 3,000s in the 
iband. The seeing was 0.65” in i and 0.70" in g. The data were reduced using the 
Gemini IRAF package. Low-order polynomials were fitted to individual (dithered) 
300-s frames after carefully masking objects, to reduce large-scale background 
gradients at low surface brightness levels. The images were used to aid in the target 
selection and mask design for the LRIS spectroscopy. The Gemini data also provide 
the best available information on the regularity of the galaxy at low surface bright- 
ness levels (see below). The combined frames still show some background artefacts, 
but they are less prominent than those in the HST data. Finally, a visual inspection 
of the Gemini images prompted us to request a change in the scheduling of HST 
program GO-14644, moving the already-planned ACS observation of NGC1052- 
DF2 to an earlier date. 

HST. The HST data were obtained as part of programme GO-14644. The aim of 
this programme is to obtain ACS images of a sample of 23 low-surface-brightness 
objects that were identified in fields of the Dragonfly Nearby Galaxy Survey*". 
NGC1052-DF2 was observed on 2016 November 16, for a total of two orbits. 
Exposure times were 2,180s in Veos and 2,3205 in Jgi4. In this paper we use images 
processed by the drizzle (variable-pixel linear reconstruction) method, which have 
been corrected for charge-transfer efficiency (CTE) effects. Despite this correction, 
some CTE artefacts are still visible in the data (see Fig. 1). 

Structural parameters. The size, surface brightness and other structural 
parameters of NGC1052-DF2 were determined from the HST data. First, the Igi4 
image was rebinned to a lower resolution to increase the signal-to-noise (S/N) ratio 
per pixel. A preliminary object mask was then created from a segmentation map 
produced by SExtractor*, using a relatively high detection threshold. A first-pass 
Sérsic model* for the galaxy was obtained using the GALFIT software’'. This 
model was subtracted from the data, and an improved object mask was created 
using a lower SExtractor detection threshold. Finally, GALFIT was run again to 
obtain the final structural parameters and total magnitude. The total magnitude in 
the Véo6 band, and the Vos — Igi4 colour, were determined by running GALFIT on 
the (binned and masked) Véo¢ image with all parameters except the total magnitude 
fixed to the Igi4 values. The structural parameters, total magnitude and colour 
are listed in the main text. We note that we measured nearly identical structural 
parameters from the Gemini images. 

Spectroscopy. We obtained spectroscopy of compact objects in the NGC1052-DF2 
field in two observing runs. The first set of observations was obtained on 2016 
September 28 with DEIMOS* on Keck II, and the second was obtained on 2016 
October 26 and 27 with LRIS** on Keck I. 

DEIMOS observations. Conditions were variable, with cirrus clouds increasing 
throughout the night. We obtained 4h of total on-source exposure time on a single 
multi-object mask; a second mask was exposed but yielded no useful data owing 
to clouds. The target selection algorithm gave priority to compact objects with 
i< 22.5 near NGC1052-DF2, selected from the MMT data. We used the 1,200 
lines mm’ grating with a slit width of 0.75", providing an instrumental resolution 
of Oinstr-¥ 25 kms~!. The data were reduced with the same pipeline that we used 
previously’? for the Coma UDGs Dragonfly 44 and DFX1. The globular clusters 
GC-39, GC-71, GC-73, GC-77, GC- 85, GC-92 and GC-98 (see Figs 1 and 2) were 
included in this mask. 

LRIS observations. Two multi-slit masks were observed, one for 3.5h (on source) 
on October 26 and a second for 4h on October 27. Targets were selected from 
the Gemini data, giving priority to compact objects that had not been observed 
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with DEIMOS. Conditions were fair during both nights, with intermittent cirrus 
and seeing of about 1”. We used the gold-coated grating with 1,200 lines mm! 
blazed at 9,000 A. The instrumental resolution instr is about 30kms~!. A custom 
pipeline was used for the data reduction, modelled on the one that we developed 
for DEIMOS. LRIS suffers from considerable flexure, and the main difference 
from the DEIMOS pipeline is that each individual 1,800-s exposure was reduced 
and calibrated independently to avoid smoothing of the combined spectra in the 
wavelength direction. The clusters GC-39, GC-59, GC-73, GC-91 and GC-101 
were included in the first mask; GC-39, GC-71, GC-77, GC-85 and GC-92 were 
included in the second. 

Combined spectra. Most compact objects were observed multiple times, and we 
combined these individual spectra to increase the S/N ratio. All spectra were given 
equal weight, and before combining they were divided by a low-order polynomial 
fit to the continuum in the calcium triplet (CaT) region. We tested that weight- 
ing by the formal S/N ratio instead does not change the best-fit velocities. The 
individual spectra were also shifted in wavelength to account for the heliocentric 
correction; this needs to be done at this stage as the DEIMOS and LRIS data were 
taken one month apart. Six GCs have at least two independent observations and 
effective exposure times of about 8h; four were observed only once: GC-59, GC-91, 
GC-98 and GC-101. The S/N ratio in the final spectra ranges from 3-4 per pixel for 
GC-59, GC-98 and GC-101 to 13 per pixel for GC-73. A pixel is 0.4A or 14kms~!. 
Velocity measurements. Radial velocities were determined for all objects with 
detected CaT absorption lines. No fits were attempted for background galaxies 
(based on the detection of redshifted emission lines), Milky Way stars or spectra 
with no visible features. The measurements were performed by fitting a template 
spectrum to the observations, using the emcee MCMC algorithm”. The template 
is a high-resolution model of a stellar population*', smoothed to the instrumental 
resolution. The model has an age of 11 Gyr and a metallicity of [Fe/H] =—1, which 
is consistent with the colours of the compact objects (Véos — Ig14¥ 0.35); the results 
are independent of the precise choice of template. The fits are performed over the 
observed wavelength range 8,530 A < A<8,750A and have three free parameters: 
the radial velocity, the normalization and an additive term that serves as a template 
mismatch parameter (as it allows the strength of the absorption lines to vary with 
respect to the continuum). The fit was performed twice. After the first pass, all 
pixels that deviate by more than 3c were masked in the second fit. This step reduces 
the effect of systematic sky-subtraction residuals on the fit. 

The uncertainties given by the emcee method do not take systematic errors 
into account. Following previous studies*’, we determined the uncertainties in 
the velocities by shuffling the residuals. For each spectrum, the best-fitting model 
was subtracted from the data. Next, 500 realizations of the data were created by 
randomizing the wavelengths of the residual spectra and then adding the shuffled 
residuals to the best-fitting model. These 500 spectra were then fitted by using a 
simple \? minimization, and the 16th and 84th percentiles of the resulting velocity 
distribution yield the error bars. To preserve the higher noise at the location of 
sky lines, the randomization was done separately for pixels at the locations of sky 
lines and for pixels in between the lines. We find that the resulting errors show the 
expected inverse trend with the S/N ratio of the spectra, whereas the emcee errors 
show large variation at fixed S/N ratio. 

We tested the reliability of the errors by applying the same procedure to the 

individual LRIS and DEIMOS spectra for the six objects that were observed with 
both instruments (GC-39, GC-71, GC-73, GC-77, GC-85 and GC-92). For each 
object, the observed difference between the LRIS and DEIMOS velocities was 
divided by the expected error in the difference. The root mean square (r.m.s.) 
of these ratios is 1.2 + 0.3, that is, the empirically determined uncertainties are 
consistent with the observed differences between the independently measured 
LRIS and DEIMOS velocities. 
Velocity dispersion. The observed velocity distribution of the ten clusters is not 
well approximated by a Gaussian. Six of the ten have velocities that are within 
+4kms~! of the mean and one is 39 kms~! removed from the mean. As a result, 
different ways to estimate the Gaussian-equivalent velocity spread oop, yield 
different answers. The normalized median absolute deviation oopsnmad=4.7 kms}, 
the biweight®® oo45,bi=8.4kms_! and the r.m.s. Gobs,rms = 14.3kms~!. The r.m.s. 
is driven by one object with a relatively large velocity uncertainty (GC-98) and is 
inconsistent with the velocity distribution of the other nine. Specifically, for ten 
objects drawn from a Gaussian distribution and including the observed errors, 
the probability of measuring op; < 8.4 if Orms > 14.3 is 1.5%, and the probability 
of measuring Opmaa < 4-7 is 3 x 10-7. We therefore adopt the biweight dispersion 
rather than the r.m.s. when determining the intrinsic dispersion below. We then 
show that the presence of GC-98 is consistent with the intrinsic dispersion that 
we derive using this statistic. 

The observed dispersion must be corrected for observational errors, which are 
of the same order as oop; itself. We determined the intrinsic dispersion and 
its uncertainty in the following way. For a given value of ote, we generated 
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1,000 samples of ten velocities, distributed according to a Gaussian of width test. 
The ten velocities in each sample were then perturbed with errors, drawn from 
Gaussians with widths equal to the empirically determined uncertainties in the 
measured dispersions. Using the biweight estimator, ‘measured’ dispersions Oobs,test 
were calculated for all samples. If the value 8.4 is within the 16th to 84th percentiles 
of the distribution of obs,test then test is within the +10 uncertainty on Gintr. This 


method gives Gintr = 3.2733 kms~!. As the intrinsic dispersion is consistent with 
zero, a more meaningful number than the best-fit is the 90% confidence upper 
limit; we find ointy<10.5kms7!. 

We now return to the question whether GC-98 is consistent with the other nine 

objects. This cluster has a velocity offset of Av=—39*1} kms~!. For the upper 
limit on the intrinsic dispersion (cin; = 10.5 km s~') the object is a 2.40 outlier, 
and the probability of having a more than 2.40 outlier in a sample of ten is 15%. 
Interestingly, the combination of the biweight constraint of ointr < 10.5 and the 
existence of GC-98 implies a fairly narrow range of intrinsic dispersions that are 
consistent with the entire set of ten velocities (assuming that they are drawn from 
a Gaussian distribution and that the errors are correct). The probability of having 
at least one object with the velocity of GC-98 is <10% if the intrinsic dispersion is 
Cintr < 8.8 kms~ 1. Taking both 90% confidence limits at face value, the allowed 
range in the intrinsic dispersion is 8.8kms~! <ointr<10.5kms~ 1. 
Expected dispersion from Local Group galaxies. In Fig. 3a we illustrate how 
unusual the kinematics of NGC1052-DF2 are by comparing the observed velocity 
distribution to that expected from Local Group galaxies with the same stellar mass 
(broken red curve). The width of this Gaussian was calculated from the SEPT2015 
version of the Nearby Dwarf Galaxies catalogue’’. The catalogue has entries for 
both velocity dispersions and rotation velocities, and for both gas and stars. To 
obtain a homogeneous estimate we use ‘effective’ dispersions, oe = (07 +0.5Vr017)°°, 
where V;o¢ is the inclination-corrected rotation velocity. When both gas and stellar 
kinematics are available, we use the highest value of oe, as this typically is a rota- 
tion curve measurement from gas at large radii. Stellar masses were calculated 
directly from the V-band absolute magnitude assuming M/Ly = 2.0, for consistency 
with NGC1052-DF2. Five galaxies have a stellar mass that is within a factor of two 
of that of NGC 1052-DF2: these are IC 1613, NGC 6822, Sextans B and the M31 
satellites NGC 147 and NGC 185. The average dispersion of these five galaxies is 
(oes?) =32kms~!, with an rm.s. variation of 8kms"!. 

In Extended Data Fig. 2, we compare NGC 1052-DF? to the nearby dwarf 

sample in the plane of velocity dispersion versus half-light radius, with the size 
of the symbols indicating the stellar mass. Comparing NGC1052-DF? to other 
galaxies with velocity dispersions in this range, we find that its size is larger by a 
factor of about 10 and its stellar mass is larger by a factor of about 100. 
Distance. The heliocentric radial velocity of NGC 1052-DF2 is 1,803 +2kms"}, 
or 1,748 + 16kms7! after correcting for the effects of the Virgo cluster, the 
Great Attractor and the Shapley supercluster on the local velocity field“. For 
Hy=70+3kms~!Mpc~1, a Hubble flow distance of Dy: =25 + 1 Mpc is obtained. 
However, the proximity to NGC 1052 (14’, or about 80 kpc in projection) strongly 
suggests that NGC1052-DF? is associated with this massive elliptical galaxy. The 
distance to NGC 1052, as determined from surface brightness fluctuations (SBFs) 
and the fundamental plane™!®, is D952 = 20.4 + 1.0 Mpc. The velocity of NGC1052- 
DF2 with respect to NGC 1052 is then +293kms~1. 

A third distance estimate can be obtained from the luminosity function of the 
compact objects. As discussed elsewhere (P.v.D. et al., manuscript in preparation), 
the luminosity function has a narrow peak at my 22.0. The canonical globular 
cluster luminosity function can be approximated by a Gaussian with a well-defined 
peak* at My ~ —7.5. If the compact objects are typical globular clusters, the 
implied distance is Dac + 8 Mpc. The main conclusions of the paper would be 
weakened considerably if the galaxy were so close to us. For this distance, the 
stellar mass estimate is an order of magnitude lower: Mgtars ¥ 3 10’Mq. The four 
Local Group galaxies that have a stellar mass within a factor of two of this value 
(Fornax, Andromeda II, Andromeda VII and UGC 4879) have a mean dispersion 
of (gee) = 11.7+0.5kms "|, only slightly higher than the upper limit to the disper- 
sion of NGC1052-DF2. The peculiar velocity of the galaxy would be about 
1,200kms~!; this is, of course, extreme, but it is difficult to argue that it is less likely 
than having a highly peculiar globular cluster population and a lack of dark matter. 

Fortunately, we have independent information to verify the distance, namely 
the appearance of NGC1052-DF? in the HST images. In Extended Data Fig. 3a, 
we show the central 33” x 33” of the galaxy in the HST [14 band. A smooth model 
of the galaxy, obtained by median filtering the image, was subtracted; background 
galaxies and globular clusters were masked. The mottled appearance is not due to 
noise but due to the variation in the number of giants contributing to each pixel. 
Following previous studies*“*4”, we measure the SBF signal from this image and 
determine the distance from the SBF magnitude. 

The azimuthally averaged power spectrum of the image is shown in Extended 
Data Fig. 3b. As is customary*®, the smallest and largest wavenumbers are omitted, 


as they are dominated by residual large-scale structure in the image and by noise 
correlations, respectively. Again following previous studies”, the power spectrum 
is fitted by a combination of a constant (dotted line) and the expectation power 
spectrum E(k) (dashed line). The expectation power spectrum is the convolution of 
the power spectrum of the point spread function and that of the window function. 
The window function is the square root of the median-filtered model of the galaxy, 
multiplied by the mask containing the globular clusters and background galaxies. 

The normalization of E(k) is the SBF magnitude, mg,4. We find mgi4= 
29.45 + 0.10. Using equation (2) in ref. 47, Veos — Is14= 0.37 £0.05, and gu75 — Igia= 
1.852(Veo6 — Ig14) + 0.096, the absolute SBF magnitude is Mgi4= —1.94+ 0.17. 
The uncertainty is a combination of the error in the Veo¢ — Igi4 colour and the 
systematic uncertainty in the extrapolation of the relation between gy75 — Igi4 col- 
our and Mgy4 (as determined from the difference between equations (1) and (2) 
in ref. 47). The SBF distance modulus m — M=31.39 £0.20, and the SBF distance 
Dspr = 19.0 + 1.7 Mpc. This result is consistent with Dj952 and rules out the 
‘globular cluster distance’ of Dgc = 8 Mpc. 

Stellar mass. We determined the stellar mass from a stellar population synthesis 
model*!. A two-dimensional model galaxy was created using the ArtPop code” 
that matches the morphology, luminosity, colour and SBF signal of NGC1052-DF2. 
The model has a metallicity [Z/H] = —1 and an age of 11 Gyr. These parameters 
are consistent with the regular morphology of the galaxy and with spectroscopic 
constraints on the stellar populations of Coma cluster UDGs”°. For a Kroupa”! 
initial mass function, the stellar mass of this model is 1.8 x 108M, similar to that 
obtained from a simple M/Ly = 2.0 conversion! (Mstars=2.2 x 108M). In the main 
text and below, we assume Metars¥ 2 X 10°M3. We note that the uncertainty in the 
stellar mass is much smaller than that in the dynamical mass, for reasonable choices 
of the initial mass function. 

Dynamical equilibrium. Some large low-surface-brightness objects are almost 
certainly in the process of disruption; examples are the ‘star pile’ in the galaxy 
cluster Abell 545°, the boomerang-shaped galaxy DF4 in the field of M101™, 
and Andromeda XIX in the Local Group* (marked in Extended Data Fig. 2). 
Andromeda XIX, and also And XXI and And XXV, are particularly informative 
as they combine large sizes with low velocity dispersions®, and it has been sug- 
gested that tidal interactions have contributed to their unusual properties*. (These 
galaxies are not direct analogues of NGC1052-DF2: the stellar masses of these 
Andromeda satellites are a factor of about 100 lower than that of NGC1052-DF2, 
and their dynamical M/L ratios are at least a factor of 10 higher.) In Extended 
Data Fig. 4c, we show the Gemini i-band image of NGC1052-DF? (along with the 
Dragonfly and SDSS images). There is no convincing evidence of strong position 
angle twists or tidal features at least out to R¥2R, (see also the Dragonfly image 
in Extended Data Fig. 4a). The regular appearance strongly suggests that the object 
has survived in its present form for multiple dynamical times, and we infer that 
the kinematics can probably be interpreted in the context of a system that is in 
dynamical equilibrium. 

We note that the regular morphology of NGC1052-DF2 also provides an inter- 
esting constraint on its formation time: the orbital velocity in the outer parts is 
5 Gyr, which means it must have formed very early in order to lose any sign of 
its assembly. Furthermore, it provides a lower limit for the 3D distance between 
NGC1052-DF2 and the massive elliptical galaxy NGC 1052. The Jacobi radius 
(that is, the distance from the centre of the galaxy to the first Lagrangian point) 
is given by” 
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with Rj952 the distance between NGC1052-DF2 and NGC 1052, and Vj9s52 the 
circular velocity of NGC 1052. Taking Rj > 5kpc, M2 x 108M. and 
Vi052300kms_! (the velocity difference between the two galaxies, as well as 
approximately /2 71952)» We obtain Rjo52 greater than about 160 kpc, a factor of two 
larger than the projected distance. 

Source of dynamical support. The morphology of the galaxy strongly indicates 
that it is supported by random motions rather than rotation: the Sérsic index is 0.6, 
similar to that of dSph galaxies; the isophotes are elliptical rather than disk-like; 
and there are no bars, spiral arms, or other features that might be expected in a thin 
disk. The galaxy has not been detected in moderately deep H 1 observations”. It 
is also difficult to imagine a physical model for the formation of a huge, extremely 
thin disk of massive blue globular clusters, even in spiral galaxies: although the 
kinematics of the metal-rich globular cluster population in M31 are clearly related 
to its disk, the metal-poor ones have a large velocity dispersion®. Finally, there is no 
evidence for a velocity gradient. In Extended Data Fig. 5a, we show the measured 
velocities of the globular clusters as a function of the projected distance along the 
major axis. There is no coherent pattern. Based on these arguments, our default 
mass measurement assumes that the galaxy is supported by random motions. 
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Dynamical mass measurement. Following a previous study of the kinematics of 
globular clusters in a UDG", we use the TME to determine the dynamical mass. 
This method was developed to determine the enclosed mass from an ensemble of 
discrete tracers, such as satellite galaxies or globular clusters!*“*. The mass within 
the distance of the outermost object is given by 


Mme = : mae eee (2) 
G 

with v the velocities of individual clusters with respect to the mean, r the projected 

distances of the clusters from the centre of the galaxy, rox the distance of the 

furthest cluster, and a the slope of the potential (with density px r-© +”)). For 

the case a = 1, the potential is similar to that of a point mass; a =0 corresponds to 

pxr anda flat rotation curve; and for a=—1 the density po r~!. Equation (2) 


does not take observational errors or outliers into account; we therefore introduce 
the modified expression 
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Here observational errors are taken into account by setting Vintr- =f '(Avobs), with 
A\obs listed in Fig. 2 and f ~! = oints/obs- Spi(x) denotes the biweight estimator of 
the width of the distribution. Note that equation (3) reduces to Mrmg = 
(C/ G)o? p:Fout» for a=0. The constant C is given by 
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with 7 the power-law slope of the density profile of the clusters, 3=1—0;’/a," the 
Binney anisotropy parameter and /(x) the gamma function. We determine the 
3D density profile from a power-law fit to the distribution of the spectroscopically 
confirmed globular clusters, finding y=0.9 + 0.3. 

For an isothermal velocity dispersion profile (a =0) and isotropic orbits (G=0) 

we determine Mr < 3.4 x 10°M. at 90% confidence. The results are not very 
sensitive to the assumed slope of the potential or moderate anisotropy. Changing 
a to 1 or —1 reduces the mass by 10% or 20% respectively. Tangential anisotropy 
with o,? =20,” increases the mass limit to Mrz < 4.2 x 10°Mo; radial anisotropy 
with o,7=0.50," yields Mrmr < 2.4 x 108M. We also consider errors in the density 
profile of the globular clusters; for y= 0.5 the mass decreases by 20% and for y= 1.5 
the mass increases by 30%. 
Robustness tests. As a test of the robustness of our results, we consider three 
alternative mass estimates. The first is the dynamical mass within the half-number 
radius of the globular cluster system!”. This mass estimate does not extend as far in 
radius as the TME method but is less sensitive to the assumed level of anisotropy. 
For Rye= 3.1 kpc and oint: < 10.5kms~! we find Mayn < 3.2 x 10°Mz (see Fig. 4). 
As the halo profile is still rising at R= 3.1 kpc, the constraint on the halo mass is 
weaker than our default value, and we find Mhalo <8 x 108M. 

The second test replaces op; with o+ms, even though the r.m.s. is driven by a 
single cluster (GC-98) and the velocity distribution of the other nine objects is 
inconsistent with this. The observed r.m.s. is Oobs,rms = 14.3kms™}, or Gintyrms = 
12.2kms ' after taking observational errors into account. The implied TME mass 
is Mgyn5 x 108Mo, and the halo mass Mpalo¥3 X 10°Mo. 

The third test sets the arguments against a disk aside and assumes that the 
observed velocities reflect rotation in an inclined, infinitely thin disk. The axis ratio 
of NGC1052-DF2 is b/a=0.85 + 0.02, which means that the inclination-corrected 
velocities are a factor of (sin(cos-! (b/a)))~!~ 1.9 higher than the observed ones. 
Assuming an (unphysical) disk dispersion of 0kms~', the inclination-corrected 
rotation velocity would be vyo¢ 1.4 x 3.2 x 1.9= 9t}4km s_!, where it is assumed 
that the rotation velocity is approximately 1.4 times the line-of-sight velocity 
dispersion®”*!. The enclosed mass within R=7.6 kpc would be Maisk= 
1.4°78 x 108M. 

For all these mass estimates, the implied ratio Mpato/Metars is less than 4. This is 

the lowest ratio measured for any galaxy and two orders of magnitude below the 
canonical relation between stellar mass and halo mass. 
Data availability. The HST data are available in the Mikulski Archive for Space 
Telescopes (MAST; http://archive.stsci.edu), under programme ID 14644. All other 
data that support the findings of this study are available from the corresponding 
author upon reasonable request. 
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Code availability. We have made use of standard data reduction tools in the IRAF 
and Python environments, and the publicly available codes SExtractor*®, GALFIT!! 
and emcee", 
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Extended Data Figure 1 | NGC1052-DF? in the Dragonfly field. The full Dragonfly field, approximately 11 degree”, centred on NGC 1052. The 
zoom-in shows the immediate surroundings of NGC 1052, with NGC1052-DF2 highlighted in the inset. 
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Extended Data Figure 2 | Comparison to Local Group galaxies. Open with a similar velocity dispersion are a factor of about 10 smaller and have 
symbols are galaxies from the Nearby Dwarf Galaxies catalogue’? and the stellar masses that are a factor of about 100 larger. The object labelled And 
solid symbol with error bars is NGC1052—DF2. The size of each symbol XIX is an Andromeda satellite that is thought to be in the process of tidal 
indicates the logarithm of the stellar mass, as shown in the key. There are disruption®>. 


no galaxies in the Local Group that are similar to NGC1052-DF2. Galaxies 
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Extended Data Figure 3 | Analysis of surface brightness fluctuation. 
We use the surface brightness fluctuation (SBF) signal in the HST Igi4 
band to constrain the distance to NGC1052-DF2. a, The galaxy after 
subtracting a smooth model and masking background galaxies and 
globular clusters. The image spans 33” x 33”. b, The azimuthally averaged 
power spectrum. Following previous studies?4°”, the power spectrum 


SBF P(k) 


50 100 150 200 250 
wave number k 


is fitted by a combination of a constant (dotted line) and an expectation 
power spectrum E(k) (dashed line). From the normalization of E(k) we 
find that the SBF magnitude mg 4 = 29.45 £0.10. The implied distance is 
Dspr = 19.0 + 1.7 Mpc, consistent with the 20 Mpc distance of the luminous 
elliptical galaxy NGC 1052. 
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Extended Data Figure 4 | Morphological coherence. a, The sum of 
gand r images taken with the Dragonfly Telephoto Array. The image 

was smoothed by a 10” x 10” median filter to bring out faint emission. 

The lowest surface brightness levels visible in the image are about 

29 magarcsec *. b, Sum of SDSS g, r and i images. In SDSS, the overdensity 
of compact objects stands out. c, The Gemini-North i-band image of 
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NGC1052-DF2, which provides the best information on the morphology 
of the galaxy. Black ellipses mark R= R, and R=2R,. White arrows mark 
the most obvious reduction artefacts. The galaxy is regular out to at least 
R 2R,, with a well-defined centre and a position angle and axis ratio that 


do not vary strongly with radius. 
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Extended Data Figure 5 | Are the globular clusters in a thin rotating disk? a, b, Globular cluster velocities as a function of projected position along the 
major axis (a) and the minor axis (b). There is no evidence for any trends. For reference, a Gaussian with o=8.4km s_!is shown in b. 
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A programmable two-qubit quantum processor 


in silicon 


T. FE Watson|, S. G. J. Philips!, E. Kawakami!, D. R. Ward’, P. Scarlino!, M. Veldhorst!, D. E. Savage’, M. G. Lagally?, Mark Friesen?, 


S. N. Coppersmith’, M. A. Eriksson? & L. M. K. Vandersypen! 


Now that it is possible to achieve measurement and control 
fidelities for individual quantum bits (qubits) above the threshold 
for fault tolerance, attention is moving towards the difficult task 
of scaling up the number of physical qubits to the large numbers 
that are needed for fault-tolerant quantum computing’”. In this 
context, quantum-dot-based spin qubits could have substantial 
advantages over other types of qubit owing to their potential for 
all-electrical operation and ability to be integrated at high density 
onto an industrial platform**. Initialization, readout and single- 
and two-qubit gates have been demonstrated in various quantum- 
dot-based qubit representations® °. However, as seen with small- 
scale demonstrations of quantum computers using other types of 
qubit'*!°, combining these elements leads to challenges related to 
qubit crosstalk, state leakage, calibration and control hardware. Here 
we overcome these challenges by using carefully designed control 
techniques to demonstrate a programmable two-qubit quantum 
processor in a silicon device that can perform the Deutsch-Josza 
algorithm and the Grover search algorithm—canonical examples of 
quantum algorithms that outperform their classical analogues. We 
characterize the entanglement in our processor by using quantum- 
state tomography of Bell states, measuring state fidelities of 85-89 
per cent and concurrences of 73-82 per cent. These results pave the 
way for larger-scale quantum computers that use spins confined to 
quantum dots. 

Solid-state approaches to quantum computing are challenging to 
realize owing to unwanted interactions between the qubit and the host 
material. For quantum-dot-based qubits, charge and nuclear-spin 
noise are the dominant sources of decoherence and gate errors. Some 
of these effects can be cancelled out by using dynamical decoupling’ or 
decoherence-free subspaces”">, but there has also been substantial 
progress in reducing these noise sources by growing better oxides and 
heterostructures!® and moving to silicon, owing to its naturally low 
abundance of nuclear-spin isotopes that can be removed through iso- 
topic purification'’. These material developments have greatly extended 
the coherence times of qubits, enabling single-qubit gate fidelities 
of above 99%!8-?!, and recently resulted in the demonstration of a 
controlled-phase (CZ) gate between two single-electron-spin qubits in 
a silicon metal-oxide-semiconductor (Si- MOS) device®. Here, we show 
that with two single-electron-spin qubits in a natural silicon/silicon 
germanium (Si/SiGe) double quantum dot we can combine initialization, 
readout, single- and two-qubit gates to form a programmable quantum 
processor in silicon that can perform simple quantum algorithms. 

A schematic of the two-qubit quantum processor is shown in Fig. la. 
The device is similar to that described previously” except for an addi- 
tional micrometre-scale magnet (“micromagnet’). A two-dimensional 
electron gas is formed in the natural silicon quantum well of a SiGe 
heterostructure using two accumulation gates. The double quantum 
dot is defined in this two-dimensional electron gas by applying negative 
voltages to the depletion gates, with the estimated positions of the first 
(D1) and second (D2) quantum dot shown by the purple and orange 
circles, respectively. The two qubits, Q1 and Q2, are defined by applying 


a finite magnetic field B.,,=617 mT and using the Zeeman-split spin- 
down |0) and spin-up |1) states of single electrons, respectively, which 
are confined in D1 and D2. The initialization and readout of Q2 are 
performed by spin-selective tunnelling to a reservoir”’, while Q1 is 
initialized at a spin-relaxation hotspot”! and measured via Q2 using a 
controlled-rotation (CROT) gate. The complete measurement sequence 
and set-up are described in Extended Data Figs 1 and 2. We achieve 
initialization (‘T subscript) and readout (‘m’ subscript) fidelities for 
the two qubits of Fy, > 99%, Fi. > 99%, Fini = 73% and Fn2 = 81% (see 
Methods). 

The coherent individual control of both qubits is achieved by pat- 
terning three cobalt micromagnets on top of the device (Fig. 1a). 
These micromagnets provide a magnetic-field gradient with a com- 
ponent that is perpendicular to the external magnetic field for electric 
dipole spin resonance (EDSR)*°. Furthermore, the field gradient 
across the two dots results in qubit frequencies that are well separated 
(for = 18.4 GHz, fa2 = 19.7 GHz), which allows the qubits to be 
addressed independently. For both qubits, we achieve Rabi frequen- 
cies of fg = wWr/(21) = 2 MHz and perform single-qubit X and Y 
gates by using vector modulation of the microwave drive signals. Here, 
we define an X (Y) gate to bea 1/2 rotation around x (y), and hence- 
forth define a 7 rotation to be X? (Y”). We measure the spin relaxation 
time (T)), free evolution time (T3) and the Hahn echo decay time 
(Totahn) Of Q1 in the (1, 1) regime (where (m, n) denotes a configu- 
ration with m electrons in D1 and n electrons in D2) to be T; > 50 ms, 
T3=1.0+40.1ps and TrHahn= 19 +3 ps, and those of Q2 to be 
T) > 3.7 £0.5 ms, T3=0.6£0.1 1s and ToHahn = 7 + 1 ps (Extended 
Data Fig. 3). Using randomized benchmarking” we find average 
single-qubit gate fidelities of 98.8% for Q1 and 98.0% for Q2 
(Extended Data Fig. 4)—close to the fault-tolerance error threshold 
for surface codes”’. 

Universal quantum computing requires the implementation of 
single- and two-qubit gates. In our quantum processor we implement 
a two-qubit CZ gate*”®. This gate can be understood by considering 
the energy-level diagram for two electron spins in a double quantum 
dot (Fig. 1b), in the regime in which the Zeeman-energy difference 
is comparable to the inter-dot tunnel coupling (AEz ~ f,). In Fig. 1b 
we plot the energies of the two-spin states (|00), |01), |10) and |11)) 
in the (1, 1) charge regime and the singlet ground state S(0, 2) in the 
(0, 2) charge regime as a function of the detuning ¢. Here, detuning 
describes the energy difference between the (1, 1) and (0, 2) charge 
states of the double quantum dot, controlled by the voltage applied to 
plunger gate P1 (Extended Data Fig. 2). The anticrossing between the 
S(0, 2) and the antiparallel |01) and |10) states causes the energy of the 
antiparallel states to decrease by J(e)/2 as the detuning is decreased 
(Fig. 1b), where J(¢) is the exchange coupling between the two elec- 
tron spins. 

The energy structure of the two-electron system can be probed 
by performing microwave spectroscopy as a function of detuning 
(Fig. 1c). At negative detuning, the resonance frequency (Zeeman 
energy) increases linearly (dashed line) owing to the electron 
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Figure 1 | Two-qubit quantum processor in silicon. a, Schematic of a 
Si/SiGe double-quantum-dot device, showing the estimated positions of 
quantum dots D1 (purple circle) and D2 (orange circle) that are used to 
confine two electron-spin qubits Q1 and Q2, respectively. Both quantum 
dots were formed from the two-dimensional electron gas (2DEG) in the 
silicon quantum well (Si QW) on the right side of the device to achieve an 
inter-dot tunnel coupling that is suitable for two-qubit gates. The positions 
of the dots were realized by tuning the numerous accumulation and 
depletion gates, but were probably helped by disorder in the Si/SiGe 
heterostructure. The ellipse shows the position of the quantum-dot sensor 
used for spin readout. Microwave signals MW1 and MW2are used to 
perform electric dipole spin resonance (EDSR), mediated by the cobalt 
(Co) micromagnets, on Q1 and Q2, respectively, while voltage pulses are 
applied to plunger gates P1 and P2 for qubit manipulation and readout. 

b, Energy-level diagram of two electron spins in a double quantum dot as a 
function of the detuning energy ¢ between the (1, 1) and (0, 2) charge 
states. Towards zero detuning the energy levels of the anti-parallel spin 
states shift by half the exchange energy J (see inset) c, Microwave 
spectroscopy of Q2 showing the spin-up probability P|,) versus the MW2 
frequency and detuning energy after initialization of Q1 to (|0) + |1))/J2. 
The detuning voltage was converted to energy using a lever arm of 

a= 0.09e, where e is the electron charge (Extended Data Fig. 5). The map 


wavefunction moving in the magnetic-field gradient. At more positive 
detuning, closer to the (0, 2) regime, the exchange energy is substantial 
compared to the line width of the resonance J/h > wr (where h = 2th is 
the Planck constant), resulting in two clear resonances. Applying a 7 
pulse at one of these frequencies results in a CROT gate, which is used 
to perform the projective measurement of Q1 via the readout of Q2 
(Extended Data Fig. 6). 

The CZ gate is implemented by applying a detuning pulse for a fixed 
amount of time ¢, which shifts the energy of the antiparallel states. 
Throughout the pulse, we stay in the regime in which J(¢) < AEz, so 
the energy eigenstates of the system are still the two-spin product states 
and the two-qubit interaction can be approximated by an Ising 
Hamiltonian, leading to the following unitary operation: 


1 0 0 0 
0 el(e)t/(2h) 0 0 
t)=Z,(0,)Z2(0 
Ucz(t) = Z1(01)Z2(2) _ ‘ pilon/On 9 (1) 
0 0 0 1 
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shows that Q2 has two different resonant frequencies (blue arrows in b) 
depending on the spin state of Q1, which are separated by the exchange 
energy J. d, The spin-up probability of Q2 after applying the Ramsey 
sequence (see inset) in which the duration of the detuning pulse is varied 
between two X gates on Q2, and the control Q1 is initialized to spin-down 
(blue curve) or spin-up (red curve). e, The spin-up probability of the target 
qubit (T; Q1) after applying the Ramsey sequence shown above the plot, in 
which a CZ gate is applied between two 1/2 pulses and the phase of the 
second 7/2 pulse is varied. Here, the control (C) qubit (C; Q2) is initialized 
to spin-down (blue curve) or spin-up (red curve) and the spin-up 
probability has been normalized to remove initialization and readout 
errors. The exchange energy during the CZ gate is J/h = 10 MHz. f, Similar 
to e, but with Q2 as the target qubit and Q1 as the control qubit. In e and 

f, the black dashed lines show the Z rotations on Q1 and Q2? that are 
needed to form the CZ; gates. g, h, Similar to e and f, but using a 
decoupled version of the CZ gate (DCZ gate), which removes the 
unconditional Z rotations due to the detuning dependence of Ez(¢). 
Consequently, the Z rotations that are required to form the CZ; gates 
(dashed black lines) are always at 90° and 270°, which simplifies the 
calibration. All error bars are 1c from the mean, calculated using a Monte 
Carlo method (see Methods). 


where the basis states are |00), |01), |10) and |11), and Z,(0,) and Z2(02) 
are rotations around Z caused by the change in the Zeeman energy of 
the qubits due to the magnetic-field gradient. The CZ gate is advanta- 
geous over the CROT gate because it is faster and less time is spent at 
low detuning, at which the qubits are more sensitive to charge noise. In 
addition, we observed that performing the CROT gate with EDSR can 
lead to state leakage into the S(0, 2) state, seen in Fig. 1c by the increase 
in background dark counts near ¢ =0. The CZ gate is demonstrated 
in Fig. 1d; we vary the duration of a CZ voltage pulse between two X 
gates on Q2 in a Ramsey experiment, showing that the frequency 
of the Z rotation on Q2 is conditional on the spin state of Ql. The 
processor's primitive two-qubit gates, CZj|m, n)= (-1)°6 5G in, n) 
for i, j, m, n € {0, 1}, are constructed by applying the CZ gate for 
a time t= h/J followed by Z rotations on QI and Q2, CZj= 
Z\[(-1)'n/2 — @,]Z.[(—1)'n/2 — 02] Ucz(th/J). Rather than physically 
performing the Z rotations, we change the reference frame in the 
software by incorporating the rotation angles 9; and 0 into the phase 
of any subsequent microwave pulses!”. 
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Combining single- and two-qubit gates together with initialization 
and readout, we demonstrate a programmable processor—whereby 
we can program arbitrary sequences for the two-qubit chip to 
execute within the coherence times of the qubits. To achieve this, 
several challenges needed to be overcome. The device had to be 
tuned further so that during single-qubit gates the exchange coupling 
was low (Jog/h = 0.27 MHz; Extended Data Fig. 7) compared to our 
single-qubit (about 2 MHz) and two-qubit (about 6-10 MHz) gate 
times. Tuning was also required to raise the energy of low-lying 
valley-excited states to prevent them from being populated during 
initialization’’. Furthermore, we observed that applying microwave 
pulses on Q] shifts the resonance frequency of Q2 by around 2 MHz. 
We rule out the AC Stark shift and effects from coupling between the 
spins and from heating as possible explanations, but find that the prop- 
erties of the quantum dots affect the frequency shift (Supplementary 
Information, section $1). Although the origin of the shift is unknown, 
we keep the resonance frequency of Q2 fixed during single-qubit gates 
by applying an off-resonant pulse (30 MHz) to Q1 if Q1 is idle. 

Before running sequences on the quantum processor, all gates need 
to be properly calibrated. The single-qubit X and Y gates were calibrated 
using both a Ramsey sequence and the AIIXY calibration sequence to 
determine the qubit resonance frequency and the power needed to 
perform a 7/2 gate (Supplementary Information, section S2). To cali- 
brate the CZ; gates we performed the Ramsey sequence shown in 
Fig. le and varied the phase of the second 1/2 gate. In Fig. le we show 
the results of this measurement, for which Q1 is the target qubit and 
the control qubit Q2 is prepared in either the |0) (blue curve) or |1) (red 
curve) state. The duration of the CZ gate is calibrated so that the blue 
and red curves are 180° out of phase. These measurements also deter- 
mine the Z rotation on Q1 that is needed to form CZij, which corres- 
ponds to the phase of the second 1/2 gate, which either maximizes or 
minimizes the spin-up probability for Q2 (dashed lines in Fig. le). 
The Z rotation needed for Q2 is calibrated by performing a similar 
measurement, but with the roles of Q1 and Q2 switched (Fig. 1f). 

The Z rotations in equation (1) can be eliminated by using a 
decoupled CZ gate, DCZ = Ucz(wh/(2J))X{X 3Ucz(wh/(2J)) , which 
incorporates refocusing pulses and can be used to perform 
DCZ j= X{X3CZ y= Zl (—1)/x/2]Z2[(—1)'x/2]DCZ. This operation 
is demonstrated in the Ramsey experiment shown in Fig. 1g, h, in 
which the minimum and maximum spin-up probabilities occur at a 
phase of either 90° or 270°. In addition to removing the need to cali- 
brate the required Z rotations, the DCZ gate is advantageous because 
it cancels out the effect of the low-frequency noise that couples to the 
spins via oz ® Iand I ® oz terms during the implementation of the 
gate, where J is the 2 x 2 identity matrix and az is the Pauli z 
matrix. 

After proper calibration, we characterize entanglement in our quan- 
tum processor by preparing Bell states and reconstructing the two-qubit 
density matrix using quantum-state tomography. The quantum circuit 
for the experiment is shown in Fig. 2a. The Bell states are prepared 
using a combination of single-qubit gates and the decoupled two-qubit 
DCZ; gates. The density matrix is reconstructed by measuring two-spin 
probabilities for the nine combinations of three different measurement 
bases (x, y, z) with 10,000 repetitions (Methods). In our readout scheme 
the states are projected into the z basis, while measurements in the other 
bases are achieved by performing X and Y pre-rotations. Owing to the 
time needed to perform these measurements (about 2h), the frequency 
of the qubits was calibrated after every 100 repetitions. The real com- 
ponents of the reconstructed density matrices of the four Bell states 
((|00) +|11)) /-/2 (01) +|10))-/2 ) are shown in Fig. 2b-e. The state 
fidelities F = (~)|p|1) between the measured density matrix (p) and the 
target Bell state (7) range from 85% to 89% and the concurrences from 
73% to 82%, demonstrating entanglement. A parallel experiment 
reported a Bell-state fidelity of 78%’. 

To test the programmability of the two-qubit quantum processor we 
perform the Deutsch-Josza*’ and Grover search*! quantum algorithms. 
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Figure 2 | Preparation of the Bell states and two-qubit entanglement 
in silicon. a, The quantum circuit used to prepare the Bell states 

and to perform quantum state tomography. The Bell states are prepared 
using a combination of single-qubit Y gates and two-qubit DCZ; gates 
while quantum-state tomography is performed by using different 
combinations of I, X and Y pre-rotation gates, where I corresponds 

to the identity. b-e, The real component of the reconstructed density 
matrices (Re(p)), determined using a maximum-likelihood estimate for 
the four Bell states W+ = (01) + |10))/./2 (b), W~ = (|01) — |10))//2 (c), 
* = (00) + |11))/J2 (d), & = (|00) — |11))//2 (e). The imaginary 
components of the elements of the density matrices are less than 0.08 

in all cases (Supplementary Information, section $3). We measure state 
fidelities of Fy+=0.88+0.02, Fy- = 0.88+0.02, Fg+=0.85+0.02 and 
Fo = 0.89 + 0.02, and concurrences of c+ = 0.80 + 0.03, cy- = 0.82 £0.03, 
Cgt = 0.73 £0.03 and co = 0.79 + 0.03; all errors are 1a from the mean. 


se 


The Deutsch-Josza algorithm determines whether a function is con- 
stant (f,(0) =f,(1) =0 or (0) =f2(1) = 1) or balanced (f3(0) =0, 
(1) = 1 or f(0) = 1, fa(1) =0). These four functions are mapped onto 
the unitary operators Uy =1, Us, =X}, Up, = CNOT = YCZ1¥) and 
Us, = Z-CNOT = Y5CZ oY, where the overbar denotes a negative rota- 
tion. For both the controlled-NOT (CNOT) and the zero- 
controlled-NOT (Z-CNOT) gates, the target qubit is Q2. At the end of 
the sequence the input qubit (Q1) will be in the |0) or |1) state for the 
constant and balanced functions, respectively. Grover’s search algo- 
rithm provides an optimal method for finding the unique input value 
Xq of a function f(x) that gives f(xo) = 1 and f(x) =0 for all other values 
of x. In the two-qubit version of this algorithm there are four input 
values, x € {00, 01, 10, 11}, resulting in four possible functions fil), 
with i, j € {0, 1}. These functions are mapped onto the unitary opera- 
tors, CZ |x) = (—1)4i“|x), which mark the input state with a negative 
phase if f(x) = 1. The algorithm finds the state that has been marked 
and outputs it at the end of the sequence. 

In Fig. 3 we show the measured two-spin probabilities as a function 
of time during the algorithms for each function. The experimental 
results (circles) are in good agreement with the simulated ideal cases 
(dashed lines). Although a number of repetitions are needed as a result 
of gate and readout errors, the algorithms are successful at determining 
the balanced and constant functions and finding the marked state in the 
oracle functions. The data shown as square symbols are taken shortly 
after calibration and are in line with the experimental results (circles), 
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Figure 3 | Two-qubit quantum algorithms in silicon. a, b, Quantum 
circuits showing the sequences of single- and two-qubit gates that were 
applied for the Deutsch—Josza algorithm (a) and the Grover search 
algorithm (b) for two qubits. The Deutsch-Josza algorithm determines 
whether the function Uyis constant or balanced; the Grover search 
algorithm finds the state marked by the function U, ¢, d, Two-spin 
probabilities as a function of time throughout the sequence during the 
Deutsch-Josza algorithm (c) and the Grover search algorithm (d), for each 
of four possible functions. Each experimental data point (circles) 
corresponds to 4,000 repetitions and has been normalized to remove 
readout errors. The dashed lines are the simulated ideal cases and the solid 


indicating that calibrations remain stable throughout the hour of data 
collection. The diamonds show the outcome of the algorithms using 
the DCZ gate. In most cases, the diamonds also show similar values to 
the circles, which means that the DCZ gate does not improve the final 
result. This suggests that low-frequency single-qubit noise during the 
CZ gate is not dominant. The substantial difference between Hahn- 
echo and Ramsey decay times still points to substantial low-frequency 
noise. Single-qubit low-frequency noise, whether from nuclear-spin 
or charge noise, reduces single-qubit coherence, in particular during 
wait and idle times in the algorithms. In addition, charge noise affects 
the coupling strength J during the implementation of the CZ gates. 
Numerical simulations (solid lines in Fig. 3c, d and Extended Data 
Fig. 10) show that quasi-static nuclear-spin and charge noise can repro- 
duce most of the features seen in the data for the two-qubit algorithm 
(Methods). Smaller error contributions include residual coupling dur- 
ing single-qubit operations and miscalibrations. 

Substantial improvements could be made in the performance of the 
processor by using isotopically purified ?8Si (refs 18, 19, 21), which 
would increase the coherence times of the qubits. Furthermore, recent 
experiments have shown that symmetrically operating an exchange 
gate by pulsing the tunnel coupling rather than the detuning leads 
to a gate that is less sensitive to charge noise, greatly improving 
fidelities>? >>. With these improvements, and combined with more 
reproducible and scalable device structures, quantum computers 
with multiple qubits and fidelities above the fault-tolerance threshold 
should be realizable. 
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lines are the simulated results when decoherence is introduced by 
including quasi-static nuclear-spin and charge noise (o- = 11 eV). For 
both algorithms, the square data points show the final results of the 
algorithms when all four functions are evaluated in the same measurement 
run with identical calibration. The diamonds show the result of both 
algorithms when using the DCZ gate, demonstrating similar performance 
to the CZ gate. For the Deutsch—Josza algorithm, the identity J is 
implemented as either a 200-ns wait (circles and squares) or as I= XX} 
(diamonds). All error bars are 1o from the mean. The red and blue shading 
in cand d corresponds to that in a and b. 
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METHODS 


Estimate of initialization and readout errors for Q1 and Q2. The initialization 
and readout procedures for Q1 and Q2 are described in Extended Data Fig. 2. 
The initialization and readout fidelities of Q2 were extracted by performing the 
following three experiments and measuring the resulting spin-up probabilities 
(P, P2 and P3): (i) initialize Q2 and wait for 77}; (ii) initialize Q2; and (iii) initia- 
lize and perform a x rotation on Q2. These three spin-up probabilities are related 
to the initialization fidelity (72) and the spin-up and spin-down readout fidelities 
(F\o),2 and F),),2) by 


P,;=1—F),2 

P2= Fiy,2(1 — 72) + (1 — Fio),2) Yo (2) 
P. 

<3 = Fy + (1 — Fio),2)(1 — 9) 

Py 


where P, is the expected probability to be in the spin-up state after the application 
of the 7 pulse for Q2, which is determined as described below. In equation (2) 
we assume that a waiting time of 7T| leads to 100% initialization and that 
the measured spin-up counts are due to the readout infidelity. By solving these 
equations we can extract the initialization and readout fidelities. For Q1, we per- 
formed initialization by pulsing to a spin-relaxation hotspot (Extended Data Fig. 5) 
for 500T| and we therefore assume that the initialization fidelity is about 100%. 
Consequently, the readout fidelities of Q1 were extracted by performing only 
experiments (ii) and (iii) above. The readout and initialization fidelities for Q1 
(Q2) during the state tomography experiments were estimated to be 7, > 99% 
(q2 > 99%), Fo) =92% (Fio),2 = 86%) and Fuya =54% (Fiy2 = 76%), for which we 
used P,1 = 98% (Px2=97%) on the basis of simulations that include the dephasing 
time of the qubits (see below). The average measurement fidelity Fn = (Fjo) + Fj1))/2 
for Q1 (Q2) is 73% (81%). These fidelities are limited mostly by the finite electron 
temperature T. + 130 mK and the fast spin-relaxation time of Q2 (T; =3.7 ms), 
which is probably caused by a spin-relaxation hotspot that is due to a similar valley 
splitting and Zeeman energy”®. 

Removing readout errors from the measured two-spin probabilities. In the 
experiment the measured two-spin probabilities PM = (Pito) Pin Piiby Py 
include errors due to the limited readout fidelities Fj); and F\1) ; of spin-down (0) 
and spin-up (1) electrons for qubit i. To remove these readout errors and obtain 
the actual two-spin probabilities P= (Pjoo), Pio1), Pi10), Pay) we use the relation- 
ship PM = (F, ® F2)P, where 


Foo),i 
1— Fo), 


1— Fi),i 


Fi), (3) 


State tomography. The density matrix of a two-qubit state can be expressed as 
p= jo, Mz where Mare 16 linearly independent measurement operators. The 
coefficients c; are calculated from the expectation values m; of the measurement 
operators using a maximum-likelihood estimate!!”. The expectation values were 
calculated by performing 16 combinations of (I, X, Y, X?) pre-rotations on QI and 
Q2 and measuring the two-spin probabilities over 10,000 repetitions per measure- 
ment. The two-spin probabilities were converted to actual two-spin probabilities 
by removing the readout errors using equation (3). To calculate the density matri- 
ces shown in Fig. 2 we used only the data from the (J, X, Y) pre-rotations on the 
assumption that J will give a more accurate estimate of the expectation values than 
X? owing to gate infidelities. If we include X? then we achieve state fidelities of 
80%-84% and concurrences of 67%-71% (Supplementary Information, section 
$3). In the analysis we assume that the pre-rotations are perfect, which is a reason- 
able approximation owing to the high single-qubit Clifford-gate fidelities (98%) 
compared to the measured state fidelities (85%-89%). The state tomography 
experiment was performed in parallel with the fidelity experiments described 
above and a Ramsey experiment that was used to calibrate the frequency actively. 
Error analysis. Error analysis was performed using a Monte Carlo method by assum- 
ing a multinomial distribution for the measured two-spin probabilities and a bino- 
mial distribution for the probabilities P|, P2 and P; used to calculated the fidelities. 
Values from these distributions were sampled randomly and the procedures out- 
lined above were followed. This was repeated 250 times to build up the final dis- 
tributions that we used to determine the mean values and the standard deviation. 
Simulation of two electron spins in a double quantum dot. In the simulation, 
we consider two electrons in two tunnel-coupled quantum dots, with an external 
magnetic field By applied to both dots. In addition to this field, the two dots have 
different Zeeman energies owing to the magnetic-field gradient across the double 
quantum dot that is generated by the micromagnets. The Zeeman energy of Q1 
(Q2) is denoted as Ez; (Ezz). The double-quantum-dot system is modelled by the 
Hamiltonian* 


-B O 0 0 0 0 
0 —-Av 0 0 t t 
H= 0 0 Av 0 -t —t 
0 0 0 6 O 0 
0 t -t 0 Ute 0 
0 t -t 0 0 Us-«e 


with the states |00), |01), |10), |11), S(2, 0) and S(O, 2) as the eigenbasis. In this 
Hamiltonian, 3 = (Ez) + Ez2)/2, Av=(Ez — Ez2)/2, /2t is the tunnel coupling 
between the (1, 1) and (0, 2)/(2, 0) singlet states, and U; is the on-site charging 
energy of the ith quantum dot. To study the phases of the qubits during the control 
pulses, the Hamiltonian is transformed into a rotating frame using 


H =VHV'+ih(0,V)Vt (4) 


where V= exp{ — i[Ezi(6, ® 1) + Ex(1@ 6,)]t} is the matrix that describes 
the unitary transformation normalized such that h= 1. The transformed 
Hamiltonian is 


0 0 0 0 0 0 
0 0 0 0 teiAvt teiAvt 
y- 0 0 0 0 —te iAvt te iAvt 
~ 10 0 0 0 0 0 
0 te iA" te” 0 Ute 0 
0 teiAv —teiAv g 0 Un,-—e 


To model the single-qubit gates during EDSR, we use the Hamiltonian 


Fw => Baw cos(wxt + $,)(G @I+1@ 6) 
k 


which assumes the same drive amplitude on each of the qubits. Here, k repre- 
sents the kth signal, which has an angular frequency w,, phase y; and driving 
amplitude Byw;x. This Hamiltonian is transformed into the rotating frame using 
equation (4), and the rotating-wave approximation can be made to remove the 
fast-driving elements because the Rabi frequency is much smaller than the Larmor 
precession. Doing so gives the Hamiltonian 


0 Qyeihe — Qeidvrt 0 00 

Die Av 0 0 QpeiAw2t 00 
Baw = 57 | Meee 0 0 Qi 0 0 
k 0 Dreier! Dre Av 0 00 

0 0 0 0 00 

0 0 0 0 00 


where (2, = Baw,xe'#, 2, is the complex conjugate of (2, and Aw, = wx — Waubit;- 

The dynamics of the two-qubit system can be described by the Schrédinger-von 
Neumann equation, p,, ,,=e 7 #/R el#ft/h, which we solved numerically using 
the Armadillo linear algebra library in C++, with the matrix exponentials solved 
using scaling methods (e4 = [], e4/2°) anda Taylor expansion. In the experiments, 
we apply microwave pulses with square envelopes that have a finite rise time owing 
to the limited bandwidth of the in-phase (I) and quadrature (Q) modulation chan- 
nels of the microwave vector source. For simplicity, we approximate these micro- 
wave pulses with a perfect square envelope. The detuning pulses were modelled 
with a finite rise and fall time using a Fermi-Dirac function to take (a)diabatic 
effects into account. The finite rise time was set to 2ns on the basis of the cut-off 
frequency of the low-pass filter that was attached to the lines and used to pulse the 
detuning pulses. 
Modelling noise in the simulation. In the model we include three different noise 
sources. The first two noise sources are from fluctuating nuclear spins in the natural 
silicon quantum well, which generate quasi-static magnetic noise that couples to 
the qubits via the Z ® Jand I ® Z terms in the Hamiltonian. These fluctuations 
are treated as two independent noise sources because D1 and D2 are in different 
locations in the quantum well and sample the field from different nuclear spins. 
The third noise source is charge noise, which can couple to the qubits via the 
magnetic-field gradient from the micromagnets. We model this noise as magnetic 
noise on the Z ® Jand I ® Z terms in the Hamiltonian. Charge noise also couples 
to the spins via the exchange coupling, which leads to noise on the Z ® Z term in 
the Hamiltonian. 

In our simulations, we treat these noise sources as quasi-static, whereby the 
noise is static within each cycle and changes only between measurement cycles. 
This approximation is reasonable because the noise in the system is pink, with 
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low frequencies in the power spectrum being more pronounced””. The static 
noise due to each noise source was modelled by sampling a random value from a 
Gaussian distribution with standard deviation o, corresponding to the contribu- 
tion to dephasing of that noise process. After sampling the static noise, the time 
evolution of the qubits during a gate sequence was calculated. This time evolution 
was averaged over many repetitions to give the final result; for each repetition new 
values for the static noise were sampled. In total, for each simulation we performed 
5,000 repetitions to ensure convergence. 

In the experiment, single-qubit gates are performed at higher detuning near the 
centre of the (1, 1) charge region at a detuning of ¢ = —3 meV, where the exchange 
is low (Jo#= 270 kHz); a two-qubit CZ gate is performed by pulsing to low detuning 
€=—0.7 meV, where the exchange is high (Jon = 6 MHz). To estimate the relative 
effect of charge noise on the Z ®@ I, I ® Zand Z ® Z terms at these two detuning 
points, we use the spectroscopy data for the qubits as a function of detuning energy 
shown in Extended Data Fig. 8. The four resonances observed correspond to the 
four transitions shown in Extended Data Fig. 8c between the |00), |01), |10) and 
|11) eigenstates. From fits of this data we estimate the derivative of the transition 
energy from state |i) to |j) at a particular detuning, dE);)_.|;)/de|-, which is directly 
proportional to the magnitude of fluctuations in the transition energy under the 
influence of charge noise. Fixing the energy of the |00) state, from these derivatives 
we calculate the relative noise levels on the other energy eigenstates: 


0 
OE\o0) |01) 
Oe 


Ble) = OE \o0) + |10) 


Oe 


(5) 
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In the regime in which J < Av, the Hamiltonian of the system can be approxi- 
mated as H= —Ez7(Z ® I) — Ex(I® Z)+J(Z @® Z) — J/4(1 @ J). The relative noise 
on Ez, Ez2 and J is found by decomposing the four noise levels in equation (5) in 
terms of the basis (—Z ® I, -I ® Z, Z ® Z, —I ® I/4) by calculating A~'B(e), where 


—1/2 -1/2 1/4 —-1/4 
—1/2 1/2 -1/4 —1/4 
1/2 -1/2 -1/4 -1/4 
1/2 1/2 1/4 —1/4 


A= 


We estimate the relative composition of the noise for (Ez1, Ez, J) ate =—3 meV 
to be (0.12, 0.24, 0) and at e = —0.7 meV (J=6 MHz) to be (0.61, 0.23, 0.26). Note 
that this is a crude approximation because we take into account only voltage noise 
along the detuning axis, whereas in reality charge noise acts also along other axes. 
We do not include calibration errors in the simulation. On the basis of the AIIXY 
and Ramsey calibration experiments (Supplementary Information, section S2), 
miscalibrations of a few per cent are possible. 

Estimating charge noise from the decay of the decoupled CZ oscillations. The 
dephasing due to charge noise coupling into the double-quantum-dot system via 
the exchange energy is measured by varying the duration of the DCZ gate between 
two 1/2 pulses on Q1 (Extended Data Fig. 9) for J=6 MHz. The DCZ gate removes 
the effect of quasi-static noise on the Z ® Jand I ® Z terms in the Hamiltonian, and 
the decay time of the oscillations T> = 1,640 ns is assumed to be due to noise on the 
Z® Zterm. The data are fitted using either a Gaussian (black line) or exponential 
decay (red line). The exponential decay seems to fit better to the data, which sug- 
gests that either higher-frequency noise plays a part® or the origin of the noise is 
from a few two-level fluctuators*°. Because the decay time of the DCZ gate is longer 
than that of the CZ gate, there is also a substantial quasi-static noise contribution. 


LETTER 


For simplicity, we include only the quasi-static contribution in our noise model. For 
Gaussian quasi-static noise with standard deviation o-, the decay time is 


The factor of 1/2 is needed because it is the noise on J/2 that contributes to the 
decay. This is because the target qubit precesses with a frequency of J/2 (ignoring 
the 1 ® Zand Z @ I terms) when the control qubit is in an eigenstate. From the 
dephasing time and dJ/de|-= 1.0 x 10~* extracted from Extended Data Fig. 8a, b, 
we estimate the charge noise on the detuning to be 11 j1eV. The data in Extended 
Data Fig. 9 used to extract this value were collected over about 40 min with no 
active calibration on the detuning pulse. The time needed for each single-shot 
measurement was around 10 ms. 

Simulations of the two-qubit algorithms. To describe the double-quantum-dot 
system used in the experiment, we use the following parameters in the 
Hamiltonian. The qubit frequencies were chosen to be Ez;/h = 18.4 GHz and 
Ez)/h= 19.7 GHz, and the on-site charging energies to be U; = U;=3.5 meV, com- 
parable to the experimental values. The tunnel coupling was chosen to be 
t=210MHz so that the residual exchange energy Jorp was 300 kHz, similar to that 
measured in the experiment. The two-qubit gates were implemented by choosing 
a value of ¢ for which J= 6 MHz when diagonalizing the Hamiltonian H. 

The results of the simulations for the Deutsch-Josza algorithm and the Grover 
algorithm using the CZ gate and the DCZ gate are shown in Fig. 3 and Extended 
Data Fig. 10. The amplitudes for the three noise sources used in the simulations 
were identical for all 16 panels. The value of charge noise used was 11 jteV (see 
above), whereas the nuclear-spin noise for Q1 and Q2 was chosen to give the 
single-qubit decoherence times T5 = 1,000 ns and T; = 600 ns measured in the 
Ramsey experiment (Extended Data Fig. 3). This gave a dephasing time of Q1 (Q2) 
due to nuclear spin of T3ny¢ = 1,200 ns (T3ny¢ = 800 ns). The simulations repro- 
duce many of the features of the experimental data for the algorithms. 

By simulating the algorithms, we learn that the residual exchange coupling 
Joe during single-qubit gates has little effect (less than 2%) on the result of the 
algorithms. Furthermore, we find that without noise on the single-qubit terms 
it is difficult to get consistent agreement with the data. Additional noise on 
the coupling strength improves the agreement. In contrast to the cases of the 
Deutsch—Jozsa algorithm and the conventional Grover algorithm, the simulation 
for the decoupled version of Grover’s algorithm predicts a better outcome than 
the experiment. This case uses the longest sequence of operations, leaving the 
most room for discrepancies between model and experiment to build up. These 
discrepancies could have several origins: (i) the implementation of the static noise 
model not being accurate enough; (ii) non-static noise having a role; (iii) the cali- 
bration errors in the gates that were left out of the simulation; or (iv) variations in 
the qubit parameters and noise levels between experiments. Finally, we note that 
initialization and readout errors are not taken into account in the simulations. 
Because initialization errors are negligible and the data shown were renormalized 
to remove the effect of readout errors, the simulated and experimental results can 
be compared directly. 

Data availability. The raw data and analysis that support the findings of this study 
are available in the Zenodo repository (https://doi.org/10.5281/zenodo.1135014). 
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Extended Data Figure 1 | Schematic of the measurement set-up. The 
sample was bonded to a printed circuit board (PCB) mounted onto 

the mixing chamber of a dilution refrigerator. All measurements were 
performed at the base temperature of the fridge, Thase* 20 mK. DC 
voltages were applied to all of the gate electrodes using room-temperature 
digital-to-analogue converters via filtered lines (not shown). Voltage 
pulses were applied to plunger gates P1 and P2 using a Tektronix 5014C 
arbitrary waveform generator (AWG) with 1-GHz clock rate. The signals 
from the AWGs passed through a room-temperature low-pass filter and 
attenuators at different stages of the fridge and were added to the DC 
signals via bias tees mounted on the PCB. Two Keysight E8267D vector 
microwave sources, MW1 and MW2, were used to apply microwaves 
(18-20 GHz) to perform EDSR on Q]1 and Q2, respectively. The signals 
passed through room-temperature DC blocks and custom-built 15-GHz 
high-pass filters and attenuators at different stages of the fridge and were 
added to the DC signals via bias tees mounted on the PCB. The output 

of the microwave source (phase, frequency, amplitude and duration) was 
controlled with I/Q vector modulation. The I/Q signals were generated 
with another Tektronix 5041C, which was the master device for the entire 
set-up and provided trigger signals for the other devices. In addition to the 
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vector modulation, we used pulse modulation to give an on/off microwave 
power output ratio of 120 dB. Although I/Q modulation can be used to 
output multiple frequencies, the bandwidth of the AWG was not sufficient 
to control both qubits with one microwave source owing to their large 
separation in frequency (1.3 GHz). The sensor current I was converted 

to a voltage signal with a custom-built preamplifier, and an isolation 
amplifier was used to separate the signal ground, with the measurement 
equipment grounded to reduce interference. Following this, a 20-kHz 
Bessel low-pass filter was applied to the signal using a SIM965 analogue 
filter. A field-programmable gate array (FPGA) analysed the voltage signal 
during the readout and assigned the trace to be spin-up if the voltage fell 
below a certain threshold. The voltage signal could also be measured with 
a digitizer card in the computer. The shapes of the pulses generated by the 
AWGs and microwave sources during qubit manipulation, along with the 
typical timescales, are shown in the lower right. Square pulses were used 
to perform the CZ gate and as the input for the I/Q modulation to generate 
microwave pulses. The pulse modulation was turned on 40 ns before 
turning on the I/Q signal, owing to the time needed for the modulation to 
switch on. 
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Extended Data Figure 2 | Measurement protocol for two electron spins. 
a, Stability diagram of the double quantum dot, showing the positions in 
gate space used to perform single-qubit gates (red circle) and two-qubit 
gates (yellow circle). The map shows the numerically obtained differential 
current dJ/dVp, through the quantum-dot charge sensor as a function of 
the voltages on P1 and P2. The white dashed line is the (1, 1)-(0,2) inter- 
dot transition line. The white arrow indicates the detuning axis € used in 
the experiments. Although the detuning pulse for the two-qubit gate 
crosses the charge-addition lines of D1 and D2, the quantum dots remain 
in the (1, 1) charge state because the pulse time is much shorter than the 
electron tunnel times to the reservoirs. b, Plot of the voltage pulses applied 
to plunger gates P1 and P2 and the response of the quantum-dot charge 
sensor over one measurement cycle. D2 is unloaded by pulsing into the 

(1, 0) charge region for 1.5 ms (purple circle). The electron on D1 is 
initialized to spin-down by pulsing to a spin-relaxation hotspot at the 

(1, 0) and (0, 1) charge degeneracies (orange circle) for 501s (see Extended 
Data Fig. 5). D2 is loaded with a spin-down electron by pulsing to the 
readout position for 4ms (blue circle). During manipulation, the voltages 


Sequence time (ms) 


on the plunger gates are pulsed to the red circle for single-qubit gates and 
to the yellow circle for two-qubit gates, where the exchange is about 

6 MHz. After manipulation, the spin of the electron on D2 is measured by 
pulsing to the readout position (blue circle) for 0.7 ms, where the Fermi 
level of the reservoir is between the spin-up and spin-down 
electrochemical potentials of D2. If the electron is spin-up then it can 
tunnel out, after which a spin-down electron tunnels back in. These two 
tunnel events are detected by the quantum-dot sensor as a single blip in 
the current signal. An additional 1.3 ms is spent at the readout position so 
that D2 is initialized to spin-down with high fidelity. Following this, Q1 is 
measured by first performing a CROT gate at the yellow circle so that 


a|00) + 3]10) “2 ajo0) + B|11), where CROT12 indicates a CROT gate 
with Q1 as the control and Q2 as the target. A projective measurement of 
Q1 is then performed by measuring Q2 at the readout position for 0.7 ms 
(blue circle). Finally, we add a compensation pulse to VP1 and VP2 so that 
over the measurement cycle Vpc=0 to mitigate charging effects in the bias 
tees. c, Close-up of the stability diagram in a showing the positions in gate 


space used for initialization and readout. 
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Extended Data Figure 3 | Single-qubit properties and two-axis control. 
The purple (top) and orange (bottom) data points correspond to 
measurements performed on Q1 and Q2, respectively, in the (1, 1) regime 
(red circle in Extended Data Fig. 2). a, Spin-up fraction as a function of the 
microwave frequency of an applied 1 pulse, showing a resonant frequency 
of 18.387 GHz (19.670 GHz) for Q1 (Q2). b, The spin-relaxation time is 
measured by preparing the qubit to spin-up and varying the wait time 
before readout. From the exponential decay in the spin-up probability we 
measure T, > 50 ms (T; = 3.7 £0.5 ms) for Q1 (Q2). ¢, Spin-up probability 
as a function of microwave duration, showing Rabi oscillations of 2.0 MHz 
for Q1 and Q2. d, The dephasing time is measured by applying a Ramsey 
pulse sequence and varying the free evolution time rT. Oscillations were 
added artificially to improve the fit of the decay by making the phase of the 


last microwave pulse dependent on the free evolution time, @ = sin(w7), 
where w= 4 MHz. By fitting the data with a Gaussian decay, 

Pj) x exp[—(r/T3)?] sin(w7), we extract T} = 1.0+0.1 1s 

(T; =0.6+0.1 1s) for Q1 (Q2). In the measurement for Q1, the first 7/2 
microwave pulse is a Y gate. The Ramsey measurement was performed 
over about 20 min with the frequency calibrated every approximately 

1 min. e, The coherence time of Q1 (Q2) can be extended to 

T2Hahn = 19 + 3 ts (T2Hahn = 7 + 11s) by a Hahn echo sequence. The 
coherence time is extracted from an exponential fit to the spin-up 
probability as a function of the free evolution time in the Hahn echo 
sequence. f, Full two-axis control is demonstrated by applying two 1/2 
pulses and varying the phase of the second one. All error bars are 1a from 
the mean. 
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Extended Data Figure 4 | Randomized benchmarking of single-qubit 
gates. Randomized benchmarking of the single-qubit gates for each qubit 
is performed by applying a randomized sequence of a varying number of 
Clifford gates m to either the |1) or |0) state and measuring the final 
spin-up probability P/,) or P\1), respectively. All gates in the Clifford group 
are decomposed into gates from the set {I, +X, +X’, +Y, +Y*}. The purple 
(orange) data points show the difference in the spin-up probabilities 

P41) — Pj1) for Q1 (Q2) as a function of sequence length. For each sequence 
length m we average over 32 different randomized sequences. From an 
exponential fit (solid lines) of the data, P\) — P\1) = ap™, we estimate 
average Clifford-gate fidelities F- = 1 — (1 — p)/2 of 98.8% and 98.0% for 
Q1 and Q2, respectively. The last three data points from both datasets were 
omitted from the fits because they deviate from a single exponential”. All 
error bars are 1o from the mean. 
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Extended Data Figure 5 | Spin-relaxation hotspots used for high- 
fidelity initialization. a, Close-up stability diagram of the (1, 0)-(0, 1) 
charge transition. The white arrow defines the detuning axis between D1 
and D2 controlled with P1. b, Schematic of the energy-level diagram as 
a function of detuning for one electron spin in a double quantum dot. 
c, Spin-relaxation hotspots are measured by preparing the electron on D1 
to spin-up using EDSR, applying a voltage pulse along the detuning axis 
(white arrow in a) for a wait time of 200 ns and performing readout of 
the electron spin. We observe three dips in the spin-up probability, which 
correspond to spin-relaxation hot spots. The first and third hotspot are 
due to anticrossings between the (0, |) and (1, 0) states and the (|, 0) and 
(0, T) states”*, The second hotspot occurs at zero detuning. The voltage 
separation between the first and third hot spot corresponds to the sum of 
the Zeeman energy of D1 and D2 divided by the gate lever arm a along 
the detuning axis. Knowing precisely the Zeeman energies from EDSR 
spectroscopy, we can accurately extract the gate lever arm to be a=0.09e. 
d, The spin-relaxation time at zero detuning (orange circle in a) is found 
to be T; = 220 ns by measuring the exponential decay of the spin-up 
probability as a function of wait time 7 at zero detuning. All error bars are 
lo from the mean. 
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Extended Data Figure 6 | Two-qubit CROT gate. a, Microwave 
spectroscopy of Q2 close to zero detuning between the (1, 1) and (0, 2) 
states (yellow dot in Extended Data Fig. 2a) with the exchange coupling on. 
The blue and red curves show the resonance of Q2 after preparing Q1 into 
spin-down and spin-up, respectively. The resonance frequency of Q2 shifts 
by the exchange coupling, and by applying a 1 pulse at one of these 
frequencies we can perform a CROT gate, which is equivalent to a CNOT 
gate up to aZ rotation. As discussed in the main text, this CROT gate is 
used to perform the projective measurement of Q1. All error bars are lo 
from the mean. 
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Extended Data Figure 7 | Measurement of Jo¢¢ using a decoupling 
sequence. The exchange coupling Jo, during single-qubit gates is measured 
using a two-qubit Hahn echo sequence, which cancels out any 
unconditional Z rotations during the free evolution time rT. Fitting the 
spin-up probability as a function of free evolution time 7 using the 
functional form sin(2TJog7), we extract Jor-—= 270 kHz. All error bars are lo 
from the mean. 
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Extended Data Figure 8 | Microwave spectroscopy of Q1 and Q2. fik = Ezjle) +(— 1)**1J(c), where j denotes the qubit and k denotes the state 
a, b, Spectroscopy of Q1 (a) and Q2 (b) versus detuning energy ¢ after of the other qubit. The data are fitted well using J(¢) « e“, Ezi(e) x e, 
initializing the other qubit to (|0) + |1))/./2. Towards e =0 there are two and Ez)(e) x e. The fitted Zeeman energies of Q1 and Q2 are shown by the 
resonances each for Q1 and Q2, which are separated by the exchange black lines. We observe that the Zeeman energy of Q1 has an exponential 
energy J(e)/h. As discussed, the Zeeman energy Ez(<) of Q1 and Q2 also dependence towards the (0, 2) charge regime (¢ = 0). This observation can 
depends on detuning because changes to the applied voltages shift the be explained by the electron delocalizing from D1 towards D2, which has a 
position of the electron in the magnetic-field gradient. The four resonance much higher Zeeman energy. c, Schematic of the colour-coded transitions 
frequencies are fitted (green, blue, red and yellow lines) with that correspond to the resonances in a and b. 
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Extended Data Figure 9 | Decay of the decoupled CZ oscillations. The 
normalized spin-up probability of Q1 as a function of the total duration 
time 27 of the two CZ gates in the DCZ sequence. The data are fitted using 
a sinusoid, P};) =0.5sin(27Jr) + 0.5, with either a Gaussian (black line; 

e~ @7/T2)”) or exponential (red line; e~27/”) decay. From these fits we find a 
decay time of T, = 1.6 1s. All error bars are 1o from the mean. 
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Extended Data Figure 10 | Simulation of the Deutsch-Josza and Grover 
search algorithms using the DCZ gate. a, b, Two-spin probabilities as a 
function of the sequence time during the Deutsch-Josza algorithm (a) and 
the Grover search algorithm (b) for each function, using the decoupled 
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version of the two-qubit CZ gate (the DCZ gate). The solid lines show the 
outcome of the simulations, which include decoherence due to quasi-static 
charge noise and nuclear-spin noise. All error bars are 1o from the mean. 
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Massive Dirac fermions in a ferromagnetic kagome 


metal 
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Aaron Bostwick, Eli Rotenberg’, David C. Bell**, Liang Fu, Riccardo Comin! & Joseph G. Checkelsky! 


The kagome lattice is a two-dimensional network of corner-sharing 
triangles! that is known to host exotic quantum magnetic states” *. 
Theoretical work has predicted that kagome lattices may also host 
Dirac electronic states® that could lead to topological® and Chern’ 
insulating phases, but these states have so far not been detected in 
experiments. Here we study the d-electron kagome metal Fe3Sn2, 
which is designed to support bulk massive Dirac fermions in 
the presence of ferromagnetic order. We observe a temperature- 
independent intrinsic anomalous Hall conductivity that persists 
above room temperature, which is suggestive of prominent Berry 
curvature from the time-reversal-symmetry-breaking electronic 
bands of the kagome plane. Using angle-resolved photoemission 
spectroscopy, we observe a pair of quasi-two-dimensional Dirac 
cones near the Fermi level with a mass gap of 30 millielectronvolts, 
which correspond to massive Dirac fermions that generate Berry- 
curvature-induced Hall conductivity. We show that this behaviour is 
a consequence of the underlying symmetry properties of the bilayer 
kagome lattice in the ferromagnetic state and the atomic spin-orbit 
coupling. This work provides evidence for a ferromagnetic kagome 
metal and an example of emergent topological electronic properties 
in a correlated electron system. Our results provide insight into the 
recent discoveries of exotic electronic behaviour in kagome-lattice 
antiferromagnets* !° and may enable lattice-model realizations of 
fractional topological quantum states!)!”, 

The kagome lattice (Fig. 1a) isa network with trihexagonal (3.6)? 
Archimedes tiling that has been studied extensively in the context of 
frustration-induced quantum-spin-liquid phases?~*. In terms of elec- 
tronic structure, simple tight-binding models on kagome lattices have 
long been known to yield unusual features, including dispersionless 
bands and Dirac points (Fig. 1b); the Dirac points appear in a man- 
ner similar to those in hexagonal graphene lattices'*. Although such 
features have not previously been observed in experiments, theo- 
retical interest has persisted and lead to several further predictions. 
Of particular interest are kagome networks in which time reversal 
symmetry is broken via ferromagnetism (Fig. 1c)*”!!, which has the 
effect of splitting the spin-degenerate Dirac bands (Fig. 1d). Further 
inclusion of spin-orbit coupling (Fig. le) yields various gapped phases 
(Fig. 1f) with integer’ or fractional’)? topological invariants (Chern 
numbers). When the chemical potential is within the Dirac gap, the 
intrinsic anomalous Hall effect, which results from the integration of 
Berry curvature over the Brillouin zone, is quantized and in principle 
affords detection of the Chern number of the wavefunction of the sys- 
tem'*'°. More generally, with multiple bands and arbitrary positions 
of the chemical potential, such measurements can detect the Berry 
curvature that is concentrated by massive Dirac bands'®. 

Despite a broad theoretical understanding of electronic Berry- 
phase effects in ferromagnetic kagome models and extensive studies 


of kagome insulators‘, experimental realization of the former has been 
challenging, in part owing to the relative rarity of kagome materials. 
An approach to realizing metallic kagome networks in the hexagonal 
transition-metal stannides A,Sny (A=Mn, Fe or Co; x:y= 3:1, 3:2 or 
1:1) has been reported!”. As shown in Fig. 1g for A = Fe (studied here), 
starting from a single layer of a hexagonal close-packed structure of 
iron atoms, a kagome net emerges naturally by replacing a 2 x 2 sub- 
lattice (dashed cell) with tin atoms, resulting in an Fe3Sn plane with an 
underlying iron kagome lattice. 

_Here we study the bilayer kagome compound Fe3Sn, (space group 
R3m; hexagonal lattice constants a= 5.338 A and c= 19.789 A)—a 
structural variation of Fe,Sn, that includes a stanene layer sandwiched 
between Fe;3Sn bilayers (Fig. 1h). In Fig. 1h we also show a correspond- 
ing transmission electron microscopy image of a (1010) cross-section 
ofa single crystal of Fe3Sn, which reveals the Fe3Sn and stanene layers. 
Previous studies!” have identified Fe;Sn, as an unusual magnetic con- 
ductor with a high Curie temperature of Tc =670 K. Although attention 
was originally focused on the zero-field spin structure'®, recent studies 
have focused on the formation of skyrmion bubbles’? and a substantial 
anomalous Hall effect”! at finite field. The latter is particularly inter- 
esting in comparison with the structurally related antiferromagnets 
Mn3Sn and Mn3Ge, which were recently reported to have a large 
room-temperature anomalous Hall response®? and possible Weyl 
fermion states’. 

Measurements of magnetization M as a function of magnetic induc- 
tion B along the c axis (Fig. 2a) demonstrate that the system is a soft 
ferromagnet, with the saturation field and saturation magnetization M, 
depending mildly on temperature T. The saturation magnetization M, 
reaches approximately 1.9/1g per iron atom at low temperature T (where 
[Jz is the Bohr magneton; Fig. 2a, inset). The crystals exhibit high 
metallicity, with the residual resistivity ratio of (300 K)/p(2 K) +25 
(Extended Data Fig. 1) allowing characterization by electrical transport. 
The transverse resistivity in the kagome plane p,,(B) (Fig. 2b) strongly 
reflects M(B)—a characteristic of the anomalous Hall effect?°!. In fer- 
romagnetic conductors it is conventional!® to express Pyx(B) in terms 
of contributions from the ordinary (Lorentz-force) Hall coefficient Ro 
and the anomalous Hall coefficient Rs: py,=RoB + R.M; as shown in 
the inset of Fig. 2b, Ry depends mildly on T (corresponding to 6 x 107! 
electrons per cm? at low T), whereas R, is much larger but decreases 
with decreasing T. 

To elucidate the role of the bilayer kagome lattice further, we examine 
the associated Hall conductivities in the kagome plane. The contribu- 
tions to the total Hall conductivity Oxy = a + Os where the super- 
script ‘N’ (A) denotes the normal (anomalous) component, can be 
separated by using the field linearity of a in the low-Hall-angle limit 
(Methods). Although o is known to have contributions of both intrin- 
sic (Berry curvature) and extrinsic (scattering) origin’®, it has recently 
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Figure 1 | The kagome structure and Fe3Snp. a, b, Structure of the 
kagome lattice (a) and the associated Dirac point in the nearest-neighbour 
tight-binding model (b), with the Brillouin zone shown in the inset. The 
band is degenerate, as denoted with red and blue spins. c, d, Ferromagnetic 
kagome lattice with broken time-reversal symmetry (moments in blue) (c) 
and the associated spin-polarized Dirac band with coupling between the 
magnetization and spin (d). e, f, Spin-orbit-coupled ferromagnetic 
kagome lattice with Berry phase ¢ accrued via hopping (e) and the 
corresponding gapped Dirac spectrum (f). g, The Fe;Sn kagome plane in 
Fe3Sno, with the kagome network shown in red. h, Transmission electron 
microscopy cross-section of Fe3Sn and the corresponding Fe3Sn and 
stanene layers viewed from the [1010] direction. 


been demonstrated that the insensitivity of the latter to thermal excita- 

tions allows the parameterization a =f (G0) 02 et Oy where f(x.) 

is a function of the residual conductivity 04,0, xx is the conductivity 

and o'™ is the intrinsic anomalous Hall conductivity”. Because 

does not depend on the scattering rate, in a system with substantial 
int ; 


Berry curvature, 0 is then the remnant o that is observed as o, 0 
(Fig. 2c, top inset). Figure 2c demonstrates that ot) remains rela- 
tively unchanged from this remnant value at high temperature, until 
T 100 K at which an upturn concomitant with increasing o,,(T) is 
observed. This upturn is indicative of the onset of an extrinsic response, 
which is expected’* with the longer relaxation time in this range of 043 
the subsequent o%, scaling of the additional 74 (Fig. 2c, top inset) is 
also consistent with an extrinsic origin’””*. The scattering-rate- 
independent value of oy at high T persists, varying by about 10% down 
to T=2K (158+16 Q-!cm}), which corresponds to approximately 
0.27e"/h per kagome bilayer, where e is the electronic charge and h is 
the Planck constant. We identify this contribution as a" (Fig. 2c), with 
behaviour akin to that expected from a massive Dirac band”, 

These observations point to a substantial Berry-curvature contri- 
bution to the transport response in Fe3Sn in a geometry that samples 
the kagome planes (Fig. 2c, middle inset). We also measured the 
Hall response perpendicular to the kagome plane o-, (Fig. 2c, lower 
inset). This out-of-plane signal is much smaller (Fig. 2c), with 
the ratio |o4 / t being less than 10% at the highest T, indicating a 
large relative enhancement of the Berry curvature in the kagome 
plane. 
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To examine the origin of this Hall response further, we measured 
the electronic structure of Fe3Snz by using angle-resolved photoemis- 
sion spectroscopy (ARPES). In Fig. 3a, b we show the experimen- 
tal Fermi surface and energy-momentum dispersion, respectively, 
of the electronic bands along high-symmetry directions parallel to 
the kagome planes, measured at T= 20 K (see also Extended Data 
Fig. 2). A rich spectrum of electronic excitations with hexagonal 
symmetry is observed, consistent with the metallicity and crystal- 
lographic structure described above. More notably, linearly dispers- 
ing Dirac cones are observed at the corner points K and K’ of the 
Brillouin zone. This spectrum, which is reminiscent of the electronic 
structure of graphene”, is the long-sought realization of kagome- 
derived Dirac fermions’. These Dirac-like bands are shown in detail 
in the high-resolution energy-momentum section of the ARPES data 
across the K point in Fig. 3c (data are collected along the blue dashed 
line in Fig. 3a and then symmetrized in momentum about K); two Dirac 
cones, separated in energy but centred at K, are resolved. Hereafter, we 
focus on these bands and their role in generating Berry curvature. 

The two-fold Dirac cones can also be identified in constant-energy 
contours (Fig. 3d). At the Fermi energy Ex (Fig. 3d, top layer), a pair of 
Dirac cones forms two electron pockets centred at K: a circular inner 
pocket and a trigonally warped outer pocket. Moving down from Er 
each pocket shrinks, forming apparent Dirac points at binding energies 
of 70 meV (Fig. 3d, second layer) and 180 meV (Fig. 3d, bottom layer). 
At the midpoint energy (125 meV), the two Dirac cones cross and, 
within our experimental resolution, form a ring of Dirac points in the 
x-y momentum plane. The experimental electronic structure near the 
K point is therefore characterized by two energy-split (AE= 110 meV) 
interpenetrating Dirac cones. This splitting is a natural consequence 
of the bilayer kagome structure, similarly to the case of multilayer 
graphene”*, whereas the exchange splitting due to magnetic order 
is expected”* to be much larger (in excess of 2eV). Photon-energy- 
dependent ARPES (Extended Data Fig. 4) reveals negligible variation 
in the Dirac bands as a function of out-of-plane momentum k,, indi- 
cating quasi-two-dimensional (quasi-2D) bands confined to the iron 
kagome bilayer. 

Having established the Dirac fermiology of Fe3Sn2, we focus on 
the role of spin-orbit coupling and the possible mass acquisition 
of the Dirac bands. Inspection of the raw ARPES data reveals that 
the spectral intensity at the Dirac point is suppressed substantially 
(Extended Data Fig. 3d), which is more clearly visualized in the second 
derivative of the ARPES map (Fig. 3e). Analysis of the energy distri- 
bution curves displayed in Fig. 3f reveals a break between the upper 
and lower branches of the Dirac cone, which signals the opening of an 
energy gap A. A quantitative analysis performed by fitting the energy 
distribution curves with Gaussian peaks returns A=30+5 meV 
(Methods). This value is similar to that predicted previously for spin- 
orbit-coupled 3d transition metals in kagome lattices’, but smaller 
than that observed in magnetically doped topological insulators (about 
50meV)*°”’ and in hydrogen-decorated graphene (at least 0.5eV)**. 

The emergence of massive Dirac fermions in Fe3Sn, can be under- 
stood as a combination of ferromagnetic splitting and spin-orbit cou- 
pling in the underlying kagome geometry. Motivated by the weak k, 
dispersion observed in ARPES, we consider a stacked system of kagome 
layers. In Fig. 4a we show a perfect Fe;Sn kagome layer and the corre- 
sponding Brillouin zone. The kagome layer has two-fold and three-fold 
rotational symmetries (C2, and C3,, respectively) that leave the K and 
K’ points invariant and thus form point group D3. In the absence of 
spin-orbit coupling, the two-fold-degenerate crossing (Dirac) points 
at K and K’ belong to a two-fold irreducible representation (E) and 
are therefore protected. As illustrated in Fig. 4b, a Dirac crossing can 
be observed at K ina tight-binding model Hx for nearest-neighbour 
hopping on the kagome sites: 
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Figure 2 | Anomalous Hall response of Fe3Sn. a, Magnetization M of 
Fe3Sn, along the c axis as a function of magnetic induction B at room 
temperature (T = 300K; orange) and low temperature (T = 2 K; black). The 
inset shows the saturation magnetization M, (measured at 2T) asa 
function of temperature T. b, Hall resistivity p,, as a function of B. The 
inset shows the ordinary and anomalous Hall coefficients Ro (black) and R, 
(purple), respectively, as a function of T. c, Anomalous Hall conductivities 


where (ij) indexes nearest-neighbour pairs, t is the hopping integral 
and ¢; (c!) is the fermion annihilation (creation) operator, which is 
taken to be spin-polarized owing to exchange. The kagome bilayers in 
Fe3Sn) (Fig. 4c) are tiled by triangles of two different bond lengths, 
2.59 A and 2.75A, as indicated by the red and blue shading. However, 
the combined unit of these kagome layers and the intervening stanene 
layer preserves the {C,, C3,} symmetry of the perfect kagome lattice 
and the Dirac points are thus protected by crystal symmetry in the 
absence of spin-orbit coupling. The additional layer degree of freedom 
further enriches the electronic structure. In particular, the ABA layer 
stacking of the structure in Fig. 4c gives rise to bonding-antibonding 
splitting”’, as seen in a simple tight-binding model with this additional 
hopping (Fig. 4d). 

We next introduce Kane-Mele-type spin-orbit coupling Hgoy to the 
tight-binding model Hx, with 


Asor= oy Ai(CHE;— NEIL) (2) 
ij 

where Aj represents the effect of spin-orbit coupling and } and 
| denote the spin quantum number””. Writing j= \(Ei x Rij) - s, 
where \ is the spin-orbit coupling constant, E is the electric field 
on the hopping path, R is the hopping vector and s represents the 
electron spin, for spin-polarized bands near K and K’ with non-zero 
z polarization sz, Hsoy effectively reduces to the Haldane term?!. 
Accordingly, for the single-layer case (Fig. 4b), when Ex is positioned 
in the Dirac gap, the system enters a Chern insulating phase with 
quantized anomalous Hall conductance”™!. 
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on, (red, left axis) and of (black, left axis) in the x-y and z-x planes, 
respectively, along with the longitudinal conductivity o,, (blue, right axis) 
and estimated intrinsic Hall conductivity o'*' (orange, left axis). The 
measurement configurations for o,, (top) and a, (bottom) are shown in 
the lower inset; I represents the charge current. The upper inset shows a 
plotted against o2,; the solid and dashed lines are the scaling curves 

(see text). 


To connect with the Hall response, we construct a k - p Hamiltonian 
near K and K’ for the dual Dirac fermions and fit to the ARPES data 
(Fig. 4e, inset; Methods). We then calculate the contribution of the 
massive Dirac bands to the Hall response by integrating the Berry 
curvature of the filled states, which yields o@!° = (0.31£0.05)e?/hat 
Ey for a kagome bilayer (Fig. 4e), comparable to the observed value 
of oi = (0.27+0.03)e?/h per bilayer. Remarkably, despite the 
simplicity of our model, the action of the quasi-2D massive Dirac 
fermions at K and K’ largely accounts for the observed Hall response 
with the crystal viewed as a parallel network of bilayer kagome planes. 
However, there are limitations; for example, for a 2D model there is 
no contribution to the out-of-plane Hall response. We suggest instead 
that this out-of-plane response originates from the three-dimensional 
(3D) network of tin atoms and the associated k,-dispersive bands 
near I’ (Methods). The relative smallness of of. is then consistent 
with a minor contribution of tin-atom-derived bands to the overall 
Berry curvature. More generally, the model evidences the role of the 
concentration of Berry curvature in the quasi-2D massive Dirac 
bands, which have small Ep comparable to the spin-orbit coupling 
strength'®. The robustness of the Hall response observed here is 
comparable to that of the Hall response that is driven by chiral 
antiferromagnetic order in Mn3Sn (ref. 8) and Mn3Ge (ref. 9); 
however, instead of originating from 3D Weyl nodes"®, the Hall 
response observed here is driven by quasi-2D Dirac fermions in a 
ferromagnetic kagome network interleaved with stanene layers. 

By combining electrical transport measurements, ARPES and 
theoretical analysis, this study provides a comprehensive proof of 
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Figure 3 | Massive Dirac fermion at the zone corner of Fe3Sn). 

a, Experimentally obtained Fermi surface of Fe3Sn. The hexagonal 
Brillouin zone (red dashed lines) and high-symmetry points (black dots) 
are marked. b, Experimentally determined band dispersion of Fe3Sn; along 
the high-symmetry directions. c, High-resolution ARPES data, measured 
along the blue dashed line in a and then symmetrized with respect to K. 
The complete band dispersion is shown in Extended Data Fig. 3 (with a 
modified colour scale). d, Constant-energy maps at binding energies of 

0 meV, 70 meV, 125 meV and 180 meV. Two electron pockets (top layer), 


principle for engineering band-structure singularities and topolog- 
ical phenomena in correlated systems. In particular, we realize lat- 
tice-driven®”*°*! topological 3d electronic bands, which we suggest 
exhibit the defining properties of a ferromagnetic kagome metal. 
Viewed in isolation, the bands near K can be considered to exhibit 


LETTER 


E, - 125 


Intensity 


Second Di 


E, - 180 


y -0.2 -0.1 0 
Energy, E-E, (eV) 


k, (A), 


a first Dirac point (second layer), a Dirac circle (third layer) and a second 
Dirac point (bottom layer) can be clearly detected from the maps. The 
corresponding energies and Brillouin zone contours are marked with 
coloured dashed lines in c and d. e, f, The second-derivative plot (e) and 
the stack of energy distribution curves (f) across the Dirac points. Both 
panels share the momentum range and direction with c. The red double- 
headed arrow marks the discontinuity between the upper and lower 
branches of the Dirac cone. Coloured markers indicate the fitted band 
energies. All data were obtained using 92-eV photons. 


a 2D ‘Chern gap’—a time-reversal-symmetry-broken topologically 
non-trivial phase that is intrinsic to stoichiometric materials and has 
a dominant contribution to the electrical response at temperatures of 
up to 300K and above. To isolate these bands, as a step towards reali- 
zing high-temperature dissipationless modes", we propose finding 
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Figure 4 | Tight binding and hall conductivity of a kagome bilayer. 

a, Two-fold (C,) and three-fold (C3,) rotation-axis symmetry operations 
of a single Fe3Sn kagome layer. b, Tight-binding band model of the single- 
layer kagome lattice with (red) and without (blue) spin-orbit coupling. 
The inset shows a magnified view of the (avoided) crossing near K. c, Co, 
and C3, symmetries of the ‘breathing’ kagome (illustrated by the red and 
blue inequivalent bonds) and stanene layers. d, Tight-binding band model 
of the double-layer kagome lattice with in-plane hopping t and inter-plane 


hopping 0.3t, with (red) and without (blue) spin-orbit coupling. The 

inset shows a magnified view of the double Dirac structure near K. The 
spin-orbit coupling strength A= 0.05¢t for b and d. e, Anomalous Hall 
conductivity 0, as a function of Fermi energy Ex from the k - p model 
(black solid curve; see text). The red and blue dashed curves represent the 
contributions from the upper and lower Dirac bands, respectively. The 
inset shows the fit (solid lines) of a massive Dirac dispersion to the ARPES 
results (circles) near K. 
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simplified structures with a single kagome layer per unit cell or pur- 
suing 2D morphologies via thin films or exfoliation of related materi- 
als with van der Waals bonding. Furthermore, the frustrated hopping 
network of kagome lattices is predicted to support electronic bands 
(as in Fig. 4d) with non-zero Chern number and enhanced electronic 
correlation’. Searching for these bands by chemically doping Fe3Sno, 
by using alternative structures and stoichiometries in A,Sn, or by using 
other spin-orbit-coupled 4d and 5d kagome metals are key to realizing 
new, magnetically driven fractionalized phases of matter’»!”. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Single-crystal growth. Single crystals of Fe;Sn2 were grown using an I,-catalysed 
reaction. A stoichiometric ratio of Fe and Sn powders was sealed in a quartz tube 
with approximately 1% I, by mass and kept in a horizontal three-zone furnace 
with a temperature gradient from 750 °C to 650 °C for five weeks followed by 
water quenching to stabilize the Fe;Sn2 phase. Hexagonal, plate-like crystals of sub- 
millimetre size (Extended Data Fig. 1a, inset) formed near the high-temperature 
region as has been reported previously** for Fe;Ge. The hexagonal surfaces were 
confirmed as (0001) kagome planes by using single-crystal X-ray diffraction. 
Magnetization measurements. Magnetization measurements were performed 
using a commercial superconducting quantum interference device (SQUID) 
magnetometer with the field oriented along the c axis and in the a-b plane. 
Demagnetization corrections were performed for all measurements. The 
measured saturation moment (1.91 per Fe atom) is consistent with previous 
reports!®1.33, 

Transport measurements. Four probe transport measurements were performed 
for longitudinal and Hall resistivity in a commercial cryostat with a superconduct- 
ing magnet. High-field transport measurements in fields of up to 31 T were per- 
formed in a He-3 cryostat at Cell-9 of the National High Magnetic Field Laboratory. 
For measurements in the kagome plane, the field was applied along the [0001] 
direction with current and voltages in the kagome plane. For Hall measurements 
perpendicular to the kagome plane, the magnetic field and current were applied 
orthogonally in the kagome plane and the out-of-plane voltage is measured. The 
choice of coordination for in-plane and out-of-plane Hall measurements is such 
that, for the ordinary Hall effect of holes, a,,>0 when the field is applied along 
the +z direction and o,, > 0 (oxz< 0) when the field is applied along the +y 
direction. Electrical signals were detected using standard AC lock-in techniques 
with a typical current density of 10 A cm. To correct for contact misalignment, 
the measured longitudinal and transverse voltages were field-symmetrized and 
field-antisymmetrized, respectively. Demagnetization corrections were performed 
for all measurements. 

Scanning transmission electron microscopy (STEM). STEM experiments were 
conducted at a probe-corrected STEM (JEOL ARM) operated at an acceleration 
voltage of 200 kV. Fe3Sn2 samples were prepared by a Helios focused-ion beam 
(FEI), operated at an acceleration voltage of 30kV for the gallium beam during 
lift-out and of 2kV during polishing. Additional polishing was performed at 1kV 
and 0.5kV with a NanoMill (Fischione). At both acceleration voltages, samples 
were polished for 20 min on each side. 

Angle-resolved photoemission spectroscopy (ARPES). ARPES experiments were 
conducted at the Microscopic and Electronic Structure Observatory (MAESTRO) 
at beamline 7 (main data) and at the meV Resolution Soft X-ray Inelastic Scattering 
Beamline (MERLIN) at beamline 4 (preliminary measurement) of the Advanced 
Light Source. The two ARPES endstations are equipped with R4000 hemispherical 
electron analysers (VG scienta, Sweden). Data in Fig. 3 and Extended Data Fig. 3 
were collected at 20 K with a photon energy of 92 eV, which maximizes the ARPES 
spectral weight of the Dirac bands. The photon-energy-dependent measurement 
was conducted from 45 eV to 120 eV (Extended Data Fig. 4). Energy and momen- 
tum resolutions were better than 15 meV and 0.01 A |, respectively. Fe3Sn) samples 
were cleaved in the ultrahigh-vacuum chamber with a base pressure of better than 
4x 10! torr. All of the data were collected within 8h after cleaving to minimize 
the effect of sample degradation. Six different samples from various growth batches 
were analysed to confirm the consistency of results. 

Longitudinal electrical transport. The resistivity in the kagome plane of sample 
Cl asa function of temperature, p(T), is shown in Extended Data Fig. 1a. A metallic 
response is seen at all T, with a residual resistivity ratio of (300 K)/p(2 K) =25. 
The magnetoresistance for magnetic induction B normal to and within the kagome 
plane is shown in Extended Data Fig. 1b, c, respectively. In both figures, the electri- 
cal current I is perpendicular to the applied field. For B\| c (Extended Data Fig. 1b), 
we observe a non-monotonic response below the saturation field (B < 1.2T), 
which may reflect a transition through a skyrmion bubble phase, as was reported 
recently’’. This structure is absent for B | c (Extended Data Fig. 1c). In addition, 
for both Bc and B 1 c, a negative linear magnetoresistance characteristic of mag- 
non suppression™ is observed at high T. 

Analysis of the Hall effect. In a ferromagnetic metal, the total Hall conductivity 
is composed of contributions from the normal Hall effect (‘N’) induced by Lorentz 
force and the anomalous Hall effect (‘A’): a = oy + oe In the limit of small Hall 


angle (Oy = Py] Pax < 1), we have oyy B/(nep-.), where nis the carrier density 
and e is the electronic charge. Given the relatively small magnetoresistance, we 
extract oN from a linear fit of 0,,(B) for fields above saturation (1.7 T to 5T), with 
the intercept returning of (Extended Data Fig. 2a). That the condition Oy < lis 
satisfied is confirmed to high field, with Oy « 0.04 for fields up to 30 T (Extended 
Data Fig. 2b). 
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We measured the anomalous Hall effect at temperature T= 300 K in multiple 
samples. We find a consistently enhanced anomalous Hall conductivity in the 
kagome plane o% relative to out of the kagome planeo%.. For samples Cl, C4 and 
C5, the observed value of cS is 163.6Q-!cm=!, 179.10Q~-!cm™! and 
138.80 !cm7|, respectively. For samples C2, C4, C5 and C6, the observed value 
of a4, is 20.507!cm7!, 22.007!cem7!, 55.607!cm7! and 53.70>!cm=}, respec- 
tively. The T dependence for C1 and C4 is shown in Fig. 2. 

Energy-momentum dispersion along high-symmetry directions. Extended Data 
Fig. 3a, e shows the experimental Fermi surface of Fe3Sn2 obtained from different 
orientations of the crystal axis with respect to the photoelectron emission plane. 
Extended Data Fig. 3b-d, f, g shows the experimental band dispersion of Fe3Sn2 
along high-symmetry directions. Despite the occurrence of a complex-matrix-el- 
ements effect near the K point, the structure of the two interpenetrating Dirac 
cones is clearly visible in all momentum directions probed in this experiment. 
In Extended Data Fig. 3h, energy distribution curves at different Brillouin-zone 
corner points are shown. All energy distribution curves show a consistent two-peak 
structure near the first Dirac point, signalling a gap opening. The velocity of the 
Dirac fermions vp is found to be isotropic in the kagome plane, with magnitude 
Vp = (1.76 £0.11) x 10°m s~!, comparable to that observed recently in iron pnic- 
tide*° and selenide*® superconductors, but lower than in graphene*”*, possibly 
reflecting the correlated character of the Fe-3d states. 
Photon-energy-dependent ARPES. Photon-energy-dependent ARPES probes 
the variation of the band structure along the momentum direction perpendicular 
to the sample surface (the k, direction in our geometry)*’. The experiment here 
was conducted by varying the photon energy from 55eV to 120eV. The ARPES 
spectral weight of the localized d electrons is suppressed at low photon energy, 
so we report only the results from 90 eV to 120 eV. Considering the large c-axis 
lattice constant (c= 19.8 A), the photon energy range presented here spans more 
than three complete Brillouin zones. The dependence on photon energy varies 
substantially between different bands. For example, the dispersion of the innermost 
electron pocket at I (Extended Data Fig. 4b) depends critically on the photon 
energy (kz), whereas the dispersion of Dirac bands shows negligible dependence 
on k,. The latter finding demonstrates the pronounced 2D nature of the Dirac 
fermions that we observed. We ascribe the highly dispersive bands centred at I‘ to 
the 3D network of Sn atoms within the crystal structure. 

Tight-binding models. We use a simple 2D tight-binding model to capture the 
symmetry-protected Dirac nodes at K and K’ in the single-layer (Fig. 4a) and bilayer 
(Fig. 4c) kagome lattice. Starting with the former, in the absence of spin-orbit cou- 
pling, the nearest-neighbour tight-binding model Hx (equation (1)) yields Dirac 
bands and a dispersionless band (Fig. 4b). For the bilayer kagome lattice, we include 
additional hopping fo = 0.3t between the vertically displaced sites on each kagome 
layer, which introduces a layer splitting of the Dirac states in energy (Fig. 4d)’. 
We examine the effect of the spin-orbit interaction by adding a Kane-Mele-type 
term*” Ho; (equation (2)) in the leading-order nearest-neighbour hopping to 
the tight-binding model Hx with strength given by A(Ej x Rj) - s. The magnetic 
moments in Fe3Sn2 have been proposed to be subjected to spin reorientation and 
microscopic domain formation’***; contributions to Aj arise from orthogonality 
of the hopping path and local electric-field and spin directions in each domain. In 
the simplified hopping model, these contributions are represented by an in-plane 
electric field and s,, with j= 0.05¢ in Fig. 4b, d. Similar Hamiltonians*°“! have 
been used in the spin sector for insulating materials with kagome structures“?. 
Model calculation of Hall conductivity. To connect with the Hall response, we 
first construct a k - p Hamiltonian near K and K’ for the dual Dirac fermions 
observed in the ARPES spectra: 


Ap = [hv p(kxoy — kyox)] @ I+ Eotx + moz (3) 


where oj (i=x, y, z) are the Pauli matrices of pseudospin for each Dirac band, Eo 
is the energy splitting of the Dirac bands described by the Pauli matrix 7,, and 
m= A/2 is the Dirac mass. To obtain the band parameters, we fit the observed 
dispersion E(k) to the massive Dirac model E(k) =+.,/(hkvp)? + (A/2)? + Ei, 
where Ey is the energy offset of the upper (i= 1) and lower (i= 2) Dirac bands 
from Er. As shown in the inset of Fig. 4e, a satisfactory fit is found with 
vp = (1.85 +0.15) x 10°m s-!, A=32+3 meV, Ej =—73 + 5meV and 
E, = —182 + 5meV. The Dirac band centred at 73 meV can be analysed reliably, 
whereas matrix-element effects and the proximity of neighbouring bands interfere 
with the intensity distribution of the lower Dirac point. Applying the same set of 
Fermi velocity and mass for the upper and lower Dirac bands, we then calculate 
the contribution of the massive Dirac bands to the Hall response by integrating the 
Berry curvature over the filled states described by equation (3) as follows. 

The Hall conductivity 7, can be considered a geometric quantity that charac- 
terizes the mapping from the k,—k, momentum plane to the two-component 
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Dirac band structure**. For a general two-level Hamiltonian d(k) - 7, (where 
oO ={0x, Oy, 2} and d(k) is a generic vector), the wave functions may be 
represented as points on the unit Bloch sphere by +d(k) = d(k)/|d(k)|, where 
the + denotes the two eigenstates of the Hamiltonian at a given k. 7, then takes 
the form* 


e2 A a aS 
y= — ff d- (Ox x Ayd)dkydky (4) 
filled states 


where the integrand (the Berry curvature) can be seen as the Jacobian associated 
with mapping the k,—k, plane to the unit sphere. Therefore, 7, is proportional to 
the total area covered on the unit sphere by the filled states. Also, because the 
integrand in equation (4) is a pseudoscalar, the states represented by td have 
opposite contributions to oxy. 

The above formulation for o,, of a single Dirac fermion is illustrated in Extended 
Data Fig. 5 for the gapless and gapped cases. For the former (Extended Data 
Fig. 5a), the wavefunction of the Dirac fermions is confined to the equator of the 
Bloch sphere and therefore ,,=0. Extended Data Fig. 5b shows the case in which 
the lower branch of the massive Dirac cone is filled. In this case, the occupied states 
span the lower hemisphere and yield 0, = 0.5e7/h. When the upper branch of the 
gapped Dirac fermion is partially filled (Extended Data Fig. 5c), a contribution 
of opposite sign appears. The resulting o,, for a single Dirac fermion is therefore 


ae for S<2n 
Po h An 
xy 2 
ib ibeat for S>2n 
h 4n 


where S is the total area of the filled states on the Bloch sphere; this is shown in 
Extended Data Fig. 5d as a function of energy E normalized by hvz (where vr is 
the Fermi velocity). 

The above formulation is consistent with that expected from the semi-classical 
Boltzmann equation and Kubo formalism for a single massive Dirac fermion”: 


_ e2 A/2 
h (4/2)? + (hv eke)? 


(5) 


Oxy 


where A is the size of the Dirac gap and ky is the Fermi wavevector. From 
equation (5) and Extended Data Fig. 5c, we see that ox, is maximized to e”/(2h) 
when Ey is within the gap and drops outside the gap with a long tail**4”. 
Importantly, the massive Dirac fermions at the K and K’ valleys are related by 
inversion symmetry and therefore contribute similarly to the Berry curvature; the 
contribution from equation (5) is therefore doubled. With the experimental fits to 


equation (3) and adding up the contributions from the upper and lower Dirac 
points to the anomalous Hall conductivity described by equations (4) and (5), we 
obtain the energy-dependent 2D Hall conductance o (Fig. 4e). The Hall conduc- 
tivity at the Fermi level evaluates to 0,,(Ep) = (0.31 0.05)e/h. The uncertainty 
within this model arises from fitting the experimental band parameters near K; 
developing models that use the complete electronic structure and Berry curvature 
to compare to the experimental results is an important future direction. 

Data availability. The data that support the findings of this study are available 
from the corresponding authors on reasonable request. 
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Extended Data Figure 1 | Metallic transport in Fe3Snp. a, Resistivity b, c, Magnetoresistance (defined as MR= [fp xx(B) — pxx(0)]/pxx(0)) at 
pas a function of temperature T in the kagome plane for Fe3Sn2 selected T with B applied perpendicular (b) or parallel (c) to the kagome 
sample C1. The inset shows a photograph of Fe3Snz single crystals. plane and B | I (schematics of the configurations are shown as insets). 
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Extended Data Figure 2 | Extracting anomalous Hall conductivity and linear fit to a The data at 2 K and 50 K have been scaled by the factors 
high-field transport. a, In-plane Hall conductivity o,, as a function of 
magnetic induction B at selected temperatures. Dashed lines represent the 


shown for clarity. b, Magnetoresistance (main panel) and Hall effect (inset) 
of Fe3Sn2 with applied magnetic field jioH || c up to 31 T. 
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Extended Data Figure 3 | Momentum and energy-dependent band in d shows the raw data of Fig. 3c (with the same energy and momentum 
structure along high-symmetry directions. a, e, Fermi surface of Fe;Sn2 range), highlighting the spectral weight distribution near the Dirac 
obtained from different experimental geometries. b-d, f, g, Band points. h, Energy distribution curves at different K points indicated in 
dispersion of Fe3Sn2 along high-symmetry directions. The panels c, d, f and g. The curves are shifted along the vertical direction for clarity. 
correspond to the momentum directions along the red (b), orange (c), The inset shows an example of Gaussian fits; the extracted gap size is 
green (d), magenta (f) and purple (g) dotted lines in a and e. The inset A=30+5 meV. 
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Extended Data Figure 4 | Photon-energy dependence of ARPES spectra 
ARPES intensity plot for Fe;Sn, taken along the I'-K direction as a 
function of binding energy k and photon energy. 
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states for the gapless case (a) and the gapped case with Er in (b) and out of 
(c) the gap. d, Fermi energy Er dependence of oy, for the case of a single 
massive Dirac fermion with gap A and Fermi velocity vp. 


Extended Data Figure 5 | Berry curvature and Hall conductivity for a 
massive Dirac fermion. a—c, Schematics of 2D Dirac fermions and the 
corresponding Bloch-sphere representation of the wavefunction of filled 
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Timing of oceans on Mars from shoreline 


deformation 


Robert I. Citron)?, Michael Manga! & Douglas J. Hemingway? 


Widespread evidence points to the existence of an ancient Martian 
ocean!-®, Most compelling are the putative ancient shorelines in 
the northern plains””. However, these shorelines fail to follow 
an equipotential surface, and this has been used to challenge the 
notion that they formed via an early ocean? and hence to question 
the existence of such an ocean. The shorelines’ deviation from a 
constant elevation can be explained by true polar wander occurring 
after the formation of Tharsis"®, a volcanic province that dominates 
the gravity and topography of Mars. However, surface loading from 
the oceans can drive polar wander only if Tharsis formed far from 
the equator”, and most evidence indicates that Tharsis formed near 
the equator!!-!°, meaning that there is no current explanation for 
the shorelines’ deviation from an equipotential that is consistent 
with our geophysical understanding of Mars. Here we show that 
variations in shoreline topography can be explained by deformation 
caused by the emplacement of Tharsis. We find that the shorelines 
must have formed before and during the emplacement of Tharsis, 
instead of afterwards, as previously assumed. Our results imply that 
oceans on Mars formed early, concurrent with the valley networks}, 
and point to a close relationship between the evolution of oceans 
on Mars and the initiation and decline of Tharsis volcanism, with 
broad implications for the geology, hydrological cycle and climate 
of early Mars. 

Distinct geological boundaries (contacts) lining the northern 
plains of Mars for thousands of kilometres have been interpreted 
as palaeo-shorelines and evidence of an early ocean?“ +’. However, 
observed long-wavelength deviations (by up to several kilometres) in 
shoreline elevation from an equipotential have been used as an argu- 
ment against the emplacement of the contacts by a body of liquid water, 
the interpretation of the features as shorelines, and the existence of a 
Martian ocean’. Perron et al.'° showed that the elevation changes of two 
extensive contacts, Arabia (contact 1) and Deuteronilus (contact 2), can 
be explained by deformation due to 30°-60° and 5°-25° of post-Tharsis 
true polar wander (TPW), respectively, because a varying rotation pole 
also changes the orientation of a planet's equatorial bulge, or polar flat- 
tening, altering equipotential surfaces (such as sea levels) globally. Such 
large magnitudes of TPW can be driven by ocean loading/unloading, 
but only if Tharsis formed far from the equator’. If Tharsis formed 
near the equator, then the remnant fossil bulge would have prevented 
ocean loading from causing large amounts of post-Tharsis TPW (see 
Extended Data Fig. 1). 

Most evidence points to the formation of Tharsis near the 
equator!!"!°. Mars’ remnant rotational figure (fossil bulge) is close to 
the equator, indicating a palaeopole of (259.5 + 49.5° E, 71 Be N), 
the likely pre-Tharsis orientation of Mars'*. The pre-Tharsis palaeopole 
also matches the likely orientation of Mars during valley network 
formation’. Formation of Tharsis probably drove only limited 
(approximately 20°) TPW to reach Mars’ current configuration, which 
precludes the possibility that surface loads drove sufficient TPW to 
deform the shorelines!™!®. 


We propose that the Arabia shoreline instead formed before or 
during the early stages of Tharsis emplacement, which initiated > 3.7 
billion years (Gyr) ago’” when the rotation pole of Mars was at the 
palaeopole (259.5° E, 71.1° N) corresponding to the fossil bulge". 
The Arabia shoreline, potentially emplaced at least 4 Gyr ago®, would 
have been modified by both topographic changes from Tharsis (which 
dominates Mars’ topography and gravity on a global scale; see Extended 
Data Fig. 2), and the approximately 20° of Tharsis-induced TPW. The 
Deuteronilus shoreline, which differs less from a present-day equi- 
potential than the older Arabia shoreline, is dated to about 3.6 Gyr 
ago!®, after most of Tharsis was emplaced. However, Tharsis had 
complex and multi-stage growth that extended into the Hesperian and 
Amazonian!”9, meaning that the Deuteronilus shoreline would have 
been deformed by the late stages of Tharsis’ emplacement. We examine 
a chronology in which shoreline deformation is due mainly to Tharsis 
(Table 1), and compare expected deformation due to Tharsis with the 
elevation profiles of the Arabia and Deuteronilus contacts. 

Assuming the Arabia shoreline formed before Tharsis, and the 
Deuteronilus shoreline formed after most of Tharsis was emplaced, 
we compare the best fits for the deformation expected from Tharsis 
to the current topography of the shorelines, including an offset factor 
Z to represent sea level at the time of shoreline formation. We also 
examine the Isidis shoreline, which formed 100 million years (Myr) 
after Deuteronilus!*. For the Arabia shoreline emplaced before Tharsis, 
deformation is expressed as the contribution of Tharsis to Mars’ topo- 
graphy along the shoreline, and the change in topography from limited 
Tharsis-induced TPW. For the Deuteronilus and Isidis shorelines 
emplaced during the late stages of Tharsis growth, deformation is taken 
as the percentage of Tharsis’ contribution to topography occurring after 
the shorelines formed, and no contribution from TPW (because reori- 
entation should occur within tens of thousands of years to a few million 
years after the Tharsis plume reaches the surface”, much less than the 
100 Myr or more that lies between Tharsis initiation and Deuteronilus 
formation). See Methods for more details. 

We show that the Arabia shoreline’s deviations from an equipoten- 
tial can be explained almost entirely by deformation due to Tharsis 
emplacement (Fig. 1). Our best fit (equation (3) with Z = —2.3 km) 
yields a root-mean square misfit Oyms of 0.615 km, comparable to the 
error values from Perron et al.'°, and follows the slope of the shoreline 
data better from 1,000 km to 6,600km. The limited Tharsis-induced 
TPW has a negligible effect. A slightly lower 7;m; is obtained if only 80% 
of Tharsis topography was emplaced after the Arabia shoreline formed 
(Extended Data Fig. 3). However, the difference between the fits using 
80% or 100% of Tharsis’ topography is negligible considering the scatter 
in the shoreline data. Our model therefore suggests that the Arabia 
shoreline formed before or during the early stages of Tharsis’ growth. 

The Deuteronilus shoreline’s deviations from an equipotential can 
be explained by deformation due to the emplacement of about 17% 
of Tharsis topography (Fig. 2), indicating that the shoreline formed 
during the late stages of Tharsis’ growth. Our best fit (equation (4) with 
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Table 1 | Possible evolution of Martian shorelines 


Time (epoch) Event 


Effect 


At least 4 Gyr ago (Early Noachian) 
>3.6 Gyr ago (Late Noachian/Early Hesperian) 


3.6 Gyr ago (Early Hesperian) 
3.5 Gyr ago (Late Hesperian) 
3.6 Gyr ago to present (Early Hesperian—Late Amazonian) 


3.5-3.0 Gyr ago 
(Late Hesperian/Early Amazonian) 


Loading of Utopia basin 


Arabia shoreline forms 
Majority of Tharsis forms 


Both Tharsis emplacement/loading and Tharsis-induced 
TPW (20°) deform the Arabia shoreline 


Deuteronilus shoreline forms 
Isidis shoreline forms 
Remainder of Tharsis forms 


Remaining Tharsis growth deforms the Arabia, 
Deuteronilus and Isidis shorelines 

Loading of the Utopia basin tilts the Isidis basin and 
deforms sections of the Deuteronilus shoreline 


The table outlines the sequence of events required to produce a pre- or early-Tharsis Arabia shoreline, and the formation of the Deuteronilus shoreline concurrent with the later stages of Tharsis’ 
growth. Explaining the Isidis shoreline’s topography requires subsequent loading of the Utopia basin (see Extended Data Fig. 4 and Methods). 


C=0.17 and Z=—3.68km) yields 0; ;=0.110km. Our fit successfully 
recovers the low elevation of the Phlegra contact, and also captures 
the decrease in elevation across Utopia and Elysium West. Neither our 
model nor the Perron et al.'° model captures the full elevation increase 
of the Tantalus segment, which may result from the topographic bulge 
from nearby Alba Patera!®. For the Isidis shoreline, subsequent loading 
of the Utopia basin is also required to explain the shoreline’s topography 
(see Extended Data Fig. 4 and Methods). 

The relation between the shorelines and global deformation due 
to Tharsis and its associated TPW is illustrated in Fig. 3a—c (also see 
Extended Data Fig. 2). We estimate the volume of water that filled the 
northern plains to the Deuteronilus and Arabia shorelines by subtracting 
the relevant Tharsis and TPW contributions from Mars’ topography 
(0.25° per pixel gridded MOLA data”!) and filling the lowlands to 
shoreline elevation (Fig. 3d-f). We estimate a Deuteronilus ocean 
volume of about 1.2 x 10” km?, and an Arabia ocean volume of about 
4.1 x 10’ km?. These are lower limits because we do not remove excess 
terrain, such as Elysium, polar deposits, lava/sediment basin deposits, 
and short-wavelength Tharsis topography (that is, variations in Tharsis 
topography that occur over short length scales). For the Arabia ocean, 
use of a map of Mars with excess terrain removed’ yields an ocean 


volume of about 5.5 x 10”7km?. The ocean volumes we compute are 
slightly lower than previous estimates”” because the Tharsis topography 
we subtract is negative in much of the area enclosed by the northern 
ocean basin. 

Short-wavelength deviations in shoreline elevation from our model 
may be due to our assumptions that both lithospheric thickness and the 
rate of Tharsis emplacement are spatially uniform. Spatial variations 
in lithospheric thickness*’ would allow for non-uniform responses to 
phenomena such as TPW"°, plate flexure and dynamic topography 
from the Tharsis plume”. Spatially variable Tharsis emplacement could 
also affect shoreline modification. Another consideration is ocean 
loading, but the computed effect on shoreline elevations is small (see 
Extended Data Fig. 5 and Methods). 

Several other short- and long-wavelength processes could have 
deformed the shorelines in the 3.5 Gyr or more since their emplace- 
ment, including dynamic topography from mantle plumes‘, 
lithospheric deformation?””®, glacial erosion”® and plate flexure from 
loading/unloading. For example, loading of the Utopia basin may have 
tilted Isidis (see Methods) and deformed sections of the Deuteronilus 
shoreline'®. Other loads that post-date shoreline formation include 
Elysium, the polar deposits, and sections of Tharsis. Such loads 
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Figure 1 | Comparison of Arabia shoreline topography to shoreline 
deformation models. a, Change in topography AT caused by TPW of 20° 
(equation (1)) and Tharsis uplift (equation (2)), illustrating that the latter 
is much more important. b, Current topography of the Arabia shoreline 
from Perron et al.'° (data originally from ref. 7) compared to the Perron 
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et al.'° model of deformation due to post-Tharsis TPW (with T, = 200 km) 
and our model of deformation due to Tharsis emplacement and induced 
TPW (AT Tharsis + ATrpw — 2.3 km). The starting point for the shoreline 
is (24.91° W, 13.48°N). 
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Figure 2 | Comparison of Deuteronilus shoreline topography to 
shoreline deformation models. Current Deuteronilus topography (data 
and contact names from ref. 18) compared to the Perron et al.'° model 


could also induce a small amount (<2°) of post-Tharsis TPW"°. Plate 
flexure associated with impact basins could also deform shorelines. 
While basins of over 1,000 km in diameter pre-date the Deuteronilus 
shoreline, some basins may have been coincident with or post-date the 
Arabia shoreline. Short-wavelength deformation may also be a con- 
sequence of the difficulty in identifying the shorelines themselves'®. 

Increased accuracy in shoreline identification and dating'® can help 
to reconstruct the history of shoreline formation and modification. 
Several potential shorelines®, such as the Ismenius, Acidalia, Elysium, 
Aeolis and Meridiani contacts, have been relatively unexamined 
owing to their high degree of discontinuity’. Shorelines may also be 
mismapped; for example, portions of the Meridiani shoreline may 
be part of the Arabia shoreline”. A re-evaluation of shorelines with 
full consideration of the various deformation processes may enable 
the development of a chronology of oceans on Mars. In particular, the 
Meridiani shoreline®”” may pre-date the Arabia shoreline and have 
contained a larger volume of water””. 

Accurate dating of the Arabia shoreline is necessary to determine 
whether the shoreline formed before or during the early stages of 
Tharsis’ growth. Formation of the Arabia shoreline after some limited 


and our model of deformation due to partial Tharsis emplacement 
(0.17 AT yharsis — 3.68 km). The starting point for the shoreline is 
(96.40° W, 63.69° N). 


early Tharsis growth is suggested by Arabia segments that border 
Acheron Fossae and Tempe Terra®, two of the oldest Tharsis units, which 
are located well north of the expected pre-Tharsis crustal dichotomy 
boundary (the stark difference in elevation and crustal thickness 
between the northern lowlands and southern highlands). However, 
it is possible that the Archeron Fossae and Tempe Terra contacts were 
misidentified as belonging to the Arabia shoreline, or that the Arabia 
shoreline initially followed the pre-Tharsis dichotomy boundary, and 
formed the Tempe Terra and Archaeon Fossae contacts only after early 
Tharsis uplift and deposition. 

The decline in ocean volume from the pre- or early-Tharsis Arabia 
shoreline to the late-Tharsis Deuteronilus shoreline suggests that 
Tharsis volcanism may have played a critical part in the evolution of a 
Martian ocean. After Tharsis was mostly emplaced, by about 3.6 Gyr 
ago, only short-lived lakes may have been stable”’, although a Late 
Hesperian/Early Amazonian ocean has also been suggested on the 
basis of tsunami evidence®*”®. Outgassing from Tharsis could have 
contributed to either heating* or cooling?! the planet, both of which 
could produce the decrease in ocean volume from the Arabia shore- 
line to the Deuteronilus shoreline as Tharsis activity declined. Either a 


Tharsis contribution 


MOLA topography to topography 


Deuteronilus Ocean 


Figure 3 | Shoreline locations relative to current topography, 
deformation due to Tharsis/TPW, and computed ocean extents. 

a, MOLA topography. b, Tharsis contribution to topography (equation 
(2)). ¢, Deformation due to approximately 20° TPW from the palaeopole 
corresponding to the fossil bulge'* with T. =58 km (equation (1)). 

d, Ocean basin filled to the Deuteronilus shoreline, with the topography 
of Mars at the time of the Deuteronilus shoreline’s formation (MOLA 
topography minus 17% of Tharsis topography). e, Ocean basin filled to the 
Arabia shoreline, with the topography of Mars at the time of the Arabia 


Arabia Ocean 
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shoreline’s formation (MOLA topography minus Tharsis topography and 
deformation due to TPW). Short-wavelength remnants of Tharsis are 
visible because Tharsis topography is only modelled up to degree-5. 

f, The ‘Mars without Tharsis’ map’», which is similar to e, but with short- 
wavelength Tharsis, Elysium, and polar topography removed, filled to the 
Arabia shoreline. Shorelines are plotted for Deuteronilus (white), Arabia 
(magenta) and Isidis (cyan), and colour scales denote elevation (or changes 
in elevation) in kilometres. 
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large ocean was in place before Tharsis volcanism initiated, and shrank 
as Tharsis volcanism cooled the planet, or an ocean arose as a result 
of heating caused by Tharsis outgassing and decreased in volume as 
Tharsis volcanism declined. It is also possible that each shoreline rep- 
resents the transient warming of an otherwise frozen ocean or glacial 
state*°, producing a liquid ocean in periods of heightened Tharsis 
activity (which, owing to enhanced surface heat flux, may also have 
resulted in catastrophic circum-Tharsis groundwater discharge’). If 
episodic warming was sufficient to melt most of Mars’ glaciers, the 
decrease in ocean volume may record a declining surface water budget. 
Although geochemical evidence for a northern ocean is ambiguous*, 
an ocean supported by the degassing of sulfur from Tharsis could 
explain the lack of widespread carbonate deposits observed in the 
northern plains**. 

The evolution of water on Mars is critical to understanding the past 
climate and habitability of the planet. Although shorelines on Mars have 
provided compelling evidence for a Martian ocean, to explain their 
deviations from an equipotential has been a challenge. We show that 
the topography of Martian shorelines can be quantitatively explained 
by deformation due to the emplacement of Tharsis and resulting TPW 
(in the case of the Arabia shoreline) or by the latter stages of Tharsis 
emplacement (in the case of the Deuteronilus shoreline). Formation 
of the Arabia shoreline before (or during the early stages of) Tharsis 
emplacement suggests that the Arabia ocean was concurrent with valley 
network incision’’, which probably occurred as part of a globally active 
hydrosphere capable of supporting such an ocean®. The consistency 
between the topography of the Martian shorelines, their ages, and the 
chronology of topographic changes due to Tharsis emplacement and 
associated TPW, suggests that the Arabia and Deuteronilus contacts 
are evidence that Martian oceans existed, and may have been linked 
to Tharsis volcanism. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Arabia shoreline (pre- or early-Tharsis formation). We assume deformation of 
the Arabia shoreline since its formation is due to global changes in topography 
resulting from Tharsis’ formation (emplacement and loading) and the approxi- 
mately 20° of Tharsis-induced TPW (Table 1). 

The topographic response to TPW is given by the change in the flattening of the 
planet caused by the difference between the centrifugal potential at the initial and 
final rotation poles!°. For a shoreline in place before TPW occurs, the deformation 
of the shoreline topography due to TPW"” is: 


2a2 
ATrpw(8, ¢) = “ag Pzotcosn) — Px,9(cos6)][h2 — (1 + ky)] (1) 


where a is the mean planetary radius, w is the rotation rate, gis the surface gravity, 
7 is the angular distance between a given current colatitude and longitude (4, y) 
and the palaeopole and hy and kz are the secular (fluid-limit) degree-2 Love num- 
bers that depend on the density and elastic structure of Mars. The unnormalized 
degree-2 Legendre polynomial P3,9(cos7) = 1(3cos?n — 1). 

The change in topography due to the emplacement of Tharsis and its associated 
loading is: 


ATrharsis(9, Y) = Stharsis(9, Y) — Noharsis(9, ) (2) 


where Syharsis and Nharsis are Tharsis’ contribution to the shape and geoid of Mars, 
respectively. We use gravity and shape coefficients for Tharsis up to degree-5 from 
Matsuyama and Manga". 

The current topography of the Arabia shoreline should therefore follow the 


deformation profile AT,, given by: 


AT4(9, (p) = ATrpw(, p) + ATrharsis(9, Y) + Z (3) 


where Z is a constant to adjust for the sea level at the time of the shoreline’s 
emplacement. We minimize the least-squares misfit (o;s) between equation (3) 
and the shoreline elevation data for the Arabia contact examined in ref. 10 (data 
originally from ref. 7). We assume a fixed palaeopole (259.5° E, 71.1° N), corre- 
sponding to the fossil bulge’. We use an elastic lithosphere thickness T. =58km, 
the expected value at the time of Tharsis’ emplacement", corresponding to hy=2.0 
andky)=1.1". 

We also test whether the Arabia shoreline can be explained by deformation 
due to only a certain percentage of Tharsis’ emplacement and associated loading, 
by multiplying AT pharsis in equation (3) by a factor C, corresponding to the 
percentage of Tharsis topography emplaced after shoreline formation (see 
Extended Data Fig. 3). 

Deuteronilus shoreline (late-stage Tharsis formation). The Deuteronilus shore- 
line post-dates the initiation of Tharsis by >100 Myr, and therefore probably 
formed after a large portion of Tharsis was emplaced. The shoreline also probably 
post-dates most Tharsis-induced TPW, which should have occurred within a few 
million years of load emplacement”. Estimates of load-driven TPW on Mars sug- 
gest timescales less than 10 Myr**"%9, well within the required pre-Deuteronilus 
timescale. Although a fraction of the 20° of Tharsis-induced TPW may be due to 
relaxation of the lithosphere and occur on longer timescales>’, this should have 
a negligible effect given the small influence of TPW on shoreline deformation 
(Fig. 1a). Accordingly, we assume that deformation to the Deuteronilus shore- 
line since its formation is due to the topographic response of Mars to only the late 
stages of Tharsis’ emplacement and associated loading. The current topography of the 
Deuteronilus shoreline should therefore follow the deformation profile ATp, given by: 


ATp(O, yp) = CAT tharsis(9, ) +Z (4) 


where AT Tharsis is deformation due to Tharsis (equation (2)), C is a constant 
representing how much of Tharsis formed after Deuteronilus was formed, and 
Z is a constant to adjust for sea level at the time of the shoreline’s formation. We 
minimize the misfit between equation (4) and the Deuteronilus shoreline elevation 
data’’, to determine the optimal amount of Tharsis topography that should post- 
date the shoreline’s formation. 

Isidis shoreline (late-stage Tharsis formation). Because the Isidis shoreline is 
100 Myr younger than Deuteronilus, we assume a similar deformation profile and 
compare equation (4) to the Isidis shoreline data!*. We use the same value of C opti- 
mized for the Deuteronilus shoreline, but allow Z to vary, reflecting that sea level 
could change in the 100 Myr between the formation of the Deuteronilus contact 
and the Isidis contact, but deformation from Tharsis topography should not change 
substantially. Although C should be slightly less for Isidis, optimizing for C would 
result in an unrealistic C=0 because deformation due to Tharsis along the Isidis 
shoreline results in a tilt that is opposite to the present tilt (Extended Data Fig. 4). 
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For the Isidis shoreline, our model predicts that deformation due to Tharsis 

would have tilted Isidis opposite to its present tilt (Extended Data Fig. 4). While 
this appears contradictory, the mismatch is possible if Isidis was tilted to its present 
orientation by loading of the Utopia basin. The Utopia basin has a large positive 
gravity anomaly**””, indicating about 18 km of excess fill*°. Such a load would have 
caused elastic plate flexure and a peripheral bulge, which could have tilted the Isidis 
basin. Using a plate flexure model, McGowan and McGill"! show that loading of 
Utopia could have tilted Isidis to an even greater extent than currently observed. 
Therefore, some amount of reverse tilting (as our model predicts) is possible. The 
timing of Utopia loading relative to the subsequent Tharsis deformation is irrele- 
vant provided that Utopia loading also occurred after the Isidis shoreline formed. 
We expect loading of Utopia to occur after Isidis shoreline formation because a 
shrinking Martian ocean would evaporate from the Utopia basin last, depositing 
the non-volatile component of the ocean there. Additionally, if the ocean became 
cold and glacial during its decline”, then receding glaciers may also have loaded 
Utopia with excess sediment. The deposits in the base of Utopia basin date to 
the early Amazonian (<3-3.46 Gyr ago)**4, after the emplacement of the Isidis 
shoreline. The eastern portion of Utopia also contains volcanic deposits from 
Elysium that date to the Amazonian“, which could also contribute to loading. 
While loading from the ocean itself is expected to produce some plate flexure, it 
is not sufficient to explain the tilt of the Isidis basin (see Extended Data Fig. 5c), 
and water loading/unloading of Utopia is also insufficient to explain Isidis’ tilt‘. 
Therefore, deposition of material from a receding liquid, muddy or frozen ocean 
may explain the tilt of the Isidis basin, even if some reverse tilting is caused by 
deformation due to Tharsis. 
Effect of elastic lithosphere thickness. The gravity and shape coefficients we use 
to subtract Tharsis topography are based on an assumed T. = 58 km, the expected 
value at the time of Tharsis loading. However, the estimate of T, yields a 90% 
confidence interval with a minimum and maximum of 26km and 92km, 
respectively’. A thinner or thicker T, would alter the deformation due to Tharsis 
(and TPW) because the Love numbers used to compute Mars’ deformation would 
change. To estimate the effect of T. = 26 km or 92 km on deformation due to 
Tharsis, we recompute Tharsis’ gravity and shape coefficients following the method 
of ref. 14. Using a fixed Tharsis centre location (258.6° E, 9.8° N), Matsuyama and 
Manga" compute the degree-2 gravity coefficients of Tharsis using a minimization 
technique with four unconstrained model parameters (T., non-dimensional Tharsis 
load Q, palaeopole colatitude 0g and palaeopole longitude yr), where the palaeo- 
pole corresponds to the axis of rotation when the fossil (remnant) bulge was 
formed. This results in probability density functions for each unconstrained 
parameter, with the weighted averages (expected values) used to compute the 
gravity and shape coefficients. We redo this analysis, as described in section 5 of 
ref. 14, but with T, treated as a constrained parameter. This allows us to estimate 
the expected values of Q, Oz and (yp for a given value of T.. We find that for 
Te = 26km, Q = 3.95, Ix =17.9°, and Gp = 259.1°. For Te=92km, Q = 1.57, 
Op=14.2°, and Py =259.3°. Tharsis’ degree-2 gravity coefficients are recomputed 
using these values. The degree-3 to -5 gravity coefficients of Tharsis are computed 
from minimization against the observed degree-3 to -5 gravity coefficients, and 
are therefore not dependent on T.. Shape coefficients for Tharsis are computed up 
to degree 5 following section 7 of ref. 14. We compute the load Love numbers using 
the ALMA code", with a five-layer model as described in ref. 14. 

We construct new best-fit deformation profiles for T. = 26 km and 92 km, 
but with the corresponding Tharsis gravity and shape coefficients that we com- 
puted for each T.. The best-fit profiles for T, = 26km and 92 km are compared 
with the nominal T.=58 km profiles in Extended Data Fig. 3. All best-fit profiles 
are relatively similar, showing that changes in T. do not have much effect on our 
conclusions. 

Effect of plate flexure. Although Perron et al.'° found that plate flexure due to 
loading of the ocean basin should not substantially affect the shoreline elevations, 
their analysis was for T. = 200 km, whereas we use T. = 58 km. The ocean basin 
resulting from our analysis also has less volume and a different shape, because 
we subtract Tharsis topography, which has a negative component in much of the 
Borealis basin. To compute plate flexure due to ocean loading, we expand the 
surface density of the ocean load in spherical harmonics and compute the asso- 
ciated displacement using the method described in ref. 46. For the Arabia ocean, 
the ocean load is computed by subtracting the pre-Tharsis topography of Mars 
from the best-fit Arabia ocean elevation (Z = —2.3 km). The pre-Tharsis Martian 
topography is computed by subtracting the deformation due to Tharsis and TPW, 
equations (2) and (1), from Mars’ current topography (0.25° per pixel gridded 
MOLA data”'). For the ocean level corresponding to the Deuteronilus and Isidis 
shorelines, only 17% of deformation due to Tharsis was subtracted from Mars’ 
current topography, and the ocean elevation Z was set to —3.68 km and —3.95 km, 
respectively. We use a Young’s modulus of 70 GPa, Poisson ratio of 0.25, and an 
assumed value of T.=58 km. The loaded shoreline topography is compared to the 
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unloaded topography in Extended Data Fig. 5. We compute a maximum magni- 
tude of deflection of 134m, 84m and 57 m, for the Arabia, Deuteronilus and Isidis 
shorelines, respectively. The mean magnitude of deflection is 35 m for the Arabia 
shoreline and 17 m for the Deuteronilus and Isidis shorelines. Deformation of the 
shorelines due to unloading of the ocean basin is negligible. 

Data availability. The data that supports the findings of this study are available on 
request from the corresponding author. Gravity and shape coefficients for Tharsis 
are included in the Supplementary Information. Shoreline data should be requested 
from the respective sources. 


34. Harada, Y. Long-term polar motion on a quasi-fluid planetary body with an 
elastic lithosphere: semi-analytic solutions of the time-dependent equation. 
Icarus 220, 449-465 (2012). 

35. Chan, N.-H. et a/. Time-dependent rotational stability of dynamic planets with 
elastic lithospheres. J. Geophys. Res. Planets 119, 169-188 (2014). 

36. Hu, H., van der Wal, W. & Vermeersen, L. L. A. A full-Maxwell approach for large 
angle polar wander of viscoelastic bodies. J. Geophys. Res. Planets 122, 
2745-2764 (2017). 

37. Moore, K. M., Chan, N.-H., Daradich, A. & Mitrovica, J. X. Time-dependent 
rotational stability of dynamic planets with viscoelastic lithospheres. /carus 
289, 34-41 (2017). 


38. 
39. 


41. 


45. 


46. 


Sjogren, W. L. Mars gravity: high-resolution results from Viking Orbiter 2. 
Science 203, 1006-1010 (1979). 

Zuber, M. T. et al. Internal structure and early thermal evolution of Mars from 
Mars Global Surveyor topography and gravity. Science 287, 1788-1793 
(2000). 


. Searls, M.L., Banerdt, W. B. & Phillips, R. J. Utopia and Hellas basins, 


Mars: twins separated at birth. J. Geophys. Res. Planets 111, E08005 

(2006). 

McGowan, E. M. & McGill, G. E. Anomalous tilt of Isidis Planitia, Mars. Geophys. 
Res. Lett. 33, LO8S06 (2006). 


. Fairén, A. G. A cold and wet Mars. Icarus 208, 165-175 (2010). 
43. 


Werner, S. C. & Tanaka, K. L. Redefinition of the crater-density and absolute-age 
boundaries for the chronostratigraphic system of Mars. Icarus 215, 603-607 
(2011). 


. Tanaka, K. L., Robbins, S. J., Fortezzo, C. M., Skinner, J. A. & Hare, T. M. The 


digital global geologic map of Mars: chronostratigraphic ages, topographic and 
crater morphologic characteristics, and updated resurfacing history. Planet. 
Space Sci. 95, 11-24 (2014). 

Spada, G. ALMA, a Fortran program for computing the viscoelastic Love 
numbers of a spherically symmetric planet. Comput. Geosci. 34, 667-687 
(2008). 
Arkani-Hamed, J. The lunar mascons revisited. J. Geophys. Res. Planets 103, 
3709-3739 (1998). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


a b 
CY Tharsis 
; 7 fossil 
bulge —~ 
paleo- 
equatorial equatorial 
bulge bulge 
d e 


---2)- 


Extended Data Figure 1 | Illustration of the feasibility of post-Tharsis 
TPW depending on the location of Tharsis’ formation. a—c, Tharsis 
(orange zone) forms far from the palaeo-equator (a), causing large-scale 
TPW as the planet reorients so Tharsis is at the equator (b). After the 
reorientation, the fossil bulge is far from the current equatorial bulge, 
making the rotation pole sufficiently unstable to allow for ocean loading 
(blue zone) to cause subsequent TPW along an arc 90° from Tharsis 
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(c, blue dashed line)!. d-f, Alternatively, Tharsis forms near the palaeo- 
equator (d), causing limited (approximately 20°) TPW (e). The position of 
the fossil bulge near the equator stabilizes the planet against subsequent 
TPW caused by oceans and other surface loads (f, see supplementary 
figure 1 of ref. 10). Formation of Tharsis near the equator is supported by 
refs 14 and 15. 
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Extended Data Figure 2 | Map of shoreline locations, MOLA and Isidis (cyan) shoreline data and regional names are from ref. 18. The 
topography, and Tharsis deformation. Arabia (magenta) shoreline contribution of Tharsis to Mars’ topography up to degree-5 (equation (2)) 
data are from ref. 10 (data originally from ref. 7). Deuteronilus (white) is displayed as 1-km dark grey contours (dashed contours are negative). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


Arabia shoreline data 


Te=26 km, C=1, Z=-1.66 km, orms=0.635 km 

Te=58 km, C=1, Z=-2.33 km, orms=0.615 km 

Te=92 km, C=1, Z=-2.41 km, orms=0.624 km 
— = 7.=26 km, C=0.79, Z=-1.83 km, orms=0.580 km 
— — 1[¢=58km, C=0.8, Z=-2.43 km, oims=0.589 km * 
— = Te=92 km, C=0.78, Z=-2.52 km, orms=0.596 km a ee 
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Deuteronilus shoreline data 
— 7[e=26 km, C=0.12, Z=-3.69 km, orms=0.105 km 
— 7e=58km, C=0.17, Z=-3.68 km, oms=0.110 km 
— 7-=92 km, C=0.17, Z=-3.68 km, oms=0.111 km 
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Extended Data Figure 3 | Effect of elastic lithosphere thickness on show the best fit when the percentage of Tharsis topography added 
deformation due to Tharsis. a, Current Arabia shoreline topography after shoreline formation was allowed to vary by a factor C. Solid lines 
compared to displacement due to TPW and Tharsis deformation assume 100% percent of Tharsis topography was emplaced after shoreline 
(equation (3)). Tharsis gravity and shape coefficients are computed formation (C= 1). b, Deuteronilus shoreline topography compared to the 
separately for T. = 26 km, 58 km and 92 km (see Methods), which each best-fit displacement due to Tharsis loading (equation (4)) for T. = 26 km, 
yield a corresponding best-fit offset Z and error 0;+ms. Dashed lines 58km and 92km. 
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Current shoreline topography 
— = Perron et al. (2007) model 
—— Present paper 
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Extended Data Figure 4 | Comparison of Isidis shoreline topography shoreline can be explained by subsequent loading of the Utopia basin 
to shoreline deformation models. Current Isidis shoreline topography (see Methods). The starting point for the shoreline is (82.32° E, 7.36° N), 
(elevation data from ref. 18) compared to the Perron et al.!° model near the southwest (SW) rim, with shoreline data proceeding clockwise 
for T. = 200km and our model of deformation due to partial Tharsis through the northeast (NE) rim. 


emplacement (0.17 AT Tharsis — 3.95 km). The topography of the Isidis 
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Extended Data Figure 5 | Effect of plate flexure due to ocean loading on shoreline topography. Current shoreline elevations are plotted against 
displaced elevations for the Arabia shoreline (a), the Deuteronilus shoreline (b) and the Isidis shoreline (c). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


doi:10.1038/nature26136 


Insulin resistance in cavefish as an adaptation to a 
nutrient -limited environment 


Misty R. Riddle, Ariel C. Aspiras!*, Karin Gaudenz’, Robert Peufs’, Jenny Y. Sung?, Brian Martineau!, Megan Peavey, 
Andrew C. Box’, Julius A. Tabin!, Suzanne McGaugh’*, Richard Borowsky’, Clifford J. Tabin! & Nicolas Rohner*° 


Periodic food shortages are a major challenge faced by organisms 
in natural habitats. Cave-dwelling animals must withstand 
long periods of nutrient deprivation, as—in the absence of 
photosynthesis—caves depend on external energy sources such as 
seasonal floods!. Here we show that cave-adapted populations of 
the Mexican tetra, Astyanax mexicanus, have dysregulated blood 
glucose homeostasis and are insulin-resistant compared to river- 
adapted populations. We found that multiple cave populations 
carry a mutation in the insulin receptor that leads to decreased 
insulin binding in vitro and contributes to hyperglycaemia. Hybrid 
fish from surface-cave crosses carrying this mutation weigh more 
than non-carriers, and zebrafish genetically engineered to carry the 
mutation have increased body weight and insulin resistance. Higher 
body weight may be advantageous in caves as a strategy to cope with 
an infrequent food supply. In humans, the identical mutation in 
the insulin receptor leads to a severe form of insulin resistance and 
reduced lifespan. However, cavefish have a similar lifespan to surface 
fish and do not accumulate the advanced glycation end-products 
in the blood that are typically associated with the progression 
of diabetes-associated pathologies. Our findings suggest that 
diminished insulin signalling is beneficial in a nutrient-limited 
environment and that cavefish may have acquired compensatory 
mechanisms that enable them to circumvent the typical negative 
effects associated with failure to regulate blood glucose levels. 

The fish species A. mexicanus consists of interfertile river-dwelling 
and cave-dwelling populations (Fig. 1a), here referred to as ‘surface 
fish’ and ‘cavefish, respectively, that experience markedly different 
nutrient availablility”. Cavefish are resistant to starvation; when food- 
deprived, cavefish lose a smaller fraction of their body weight compared 
to surface fish*. Several factors have been identified that contribute to 
starvation resistance, including reduced metabolic circadian rhythm’, 
decreased metabolic rate* and elevated body fat*. The genetic changes 
underlying these adaptations remain largely unknown. 

In this study, we focused on three cavefish populations, which are named 
for the caves they inhabit (Tinaja, Pachon and Molino), that have evolved 
independently from two different stocks of surface fish that invaded caves 
millions of years ago®. Tinaja and Pachén cavefish originated from a more 
ancient surface population compared to Molino cavefish’. 

A critical aspect of metabolic homeostasis is blood glucose 
regulation’. We compared blood glucose levels of Tinaja, Pachén and 
Molino cavefish with those of laboratory-raised surface fish, and found 
that the cave populations had significantly higher postprandial blood 
glucose levels (means of 64, 76 and 92 versus 47 mg di}, respectively; 
Fig. 1b). We investigated the dynamics of glucose homeostasis during 
short- and long-term fasting (Fig. 1c). Cavefish had significantly higher 
blood glucose levels after 24hours of fasting (mean of 80, 80 and 
107 mg dl“! for Tinaja, Pachon and Molino populations, respectively, 
versus 59 mg dl“! for surface fish, P< 0.05, one-way ANOVA). After 


21 days, we observed a marked decrease in blood glucose levels in 
Tinaja and Pachon cavefish; at 1 and 21 days, respectively, Tinaja 
cavefish showed mean blood glucose levels of 80 versus 48 mg dl! 
(P < 0.0001) and Pachon cavefish showed mean levels of 80 versus 
44mg dl“! (P< 0.0001). Surface fish experienced a minor decrease 
from 59 to 40 mg dl“! (at 1 and 21 days, respectively, P=0.02), and there 
was no significant change in Molino cavefish (107 versus 100 mg dI~! 
at 1 and 21 days, respectively, P= 0.89). Molino cavefish maintain 
elevated blood glucose levels, highlighting what appears to be a funda- 
mental difference between their metabolic adaptation and the adapta- 
tions of the other cavefish populations that we investigated. However, 
our results suggest that dysregulated glucose homeostasis is a common 
feature of cavefish populations. 

To further test this hypothesis, we compared acute control of glucose 
homeostasis using a glucose tolerance test (Fig. 1d). We injected glucose 
into the intraperitoneal cavity of surface fish and cavefish, transiently 
raising blood glucose levels to over 400mg dl~’ in most fish (Fig. 1d). 
Eight hours after injection, blood glucose levels of the surface fish were 
the same as those of PBS-injected controls (mean 126 versus 120mg dl", 
respectively; Fig. 1d) whereas blood glucose levels remained highly 
elevated in all cavefish populations (mean 374, 432 and 411 mg dl“! 
in Tinaja, Pachon and Molino populations, respectively, P< 0.0005, 
one-way ANOVA compared to PBS-injected controls; Fig. 1d). 
In Tinaja and Pachon cavefish, blood glucose levels remained elevated 
at 13.5h after injection, but were not significantly different from those 
of PBS-injected controls (Tinaja cavefish, mean 157 versus 79 mg dl“! 
in glucose-injected and PBS-injected fish, respectively, P= 0.47; in 
Pachon cavefish, mean 159 versus 105 mg dl“ in glucose-injected and 
PBS-injected fish, respectively, P= 0.65). In Molino cavefish, blood 
glucose remained significantly elevated at 13.5h (315 mg dl“! versus 
144mg dl" in PBS-injected controls, P= 0.000001) and returned to the 
same levels as PBS-injected controls after 24h (132 versus 133 mg dl“! 
in glucose-injected versus PBS-injected controls, respectively). Our 
results suggest that cavefish have impaired glucose clearance. 

Glucose homeostasis requires the balanced release of insulin and 
glucagon that instructs tissues to absorb glucose from the blood or 
produce glucose from stored glycogen?. We found that at ten days post 
fertilization the first pancreatic islet is visible and does not differ in 
the number of cells producing glucagon (54 versus 50 in surface fish 
and cavefish, respectively, P=0.678) or insulin (54 versus 52 in surface 
fish and cavefish, respectively, P=0.275), which suggests that the pan- 
creas develops similarly in surface fish and cavefish (n=5 per popula- 
tion; Extended Data Fig. 1). In 1-2-year-old adults, we did not detect 
a difference in circulating glucagon levels after a 24h fast (Extended 
Data Fig. 2). Circulating insulin levels tended to be higher in Tinaja 
cavefish, but the results were not significant (n = 24 per population; 
Extended Data Fig. 3). Nonetheless, we found evidence of diminished 
insulin response in cavefish. We injected the fish with arginine, which 
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Figure 1 | Altered glucose homeostasis in cave-adapted A. mexicanus 
populations. a, Surface fish and Tinaja cavefish of A. mexicanus. b, Blood 
glucose (1h postprandial) in surface fish compared to cavefish (n = 10, 
13, 3 and 3, respectively, for surface fish, Tinaja, Pachon and Molino 
cavefish). c, Fasting blood glucose at day 1 versus day 21 (n= 20 per 
population and condition). d, Glucose tolerance test. Blood glucose 

after intraperitoneal injection of glucose (red) or PBS (blue). Data 

points represent values for individual fish and grey shade indicates 95% 
confidence interval for polynomial regression. e, Blood glucose 5h 

after intraperitoneal injection of arginine (m= 10 per population and 


stimulates the simultaneous release of glucagon and insulin'®, and 
observed that while surface fish experienced a significant decrease in 
blood glucose level (mean 80 versus 38 mg dl~', PBS-injected versus 
arginine-injected fish, P= 0.006), cavefish blood glucose levels did not 
change (Fig. le). In addition, we found that injection of recombinant 
human insulin caused a significant drop in blood glucose after 60 min 
in surface fish, but not in Tinaja cavefish (Extended Data Fig. 4). Our 
combined observations that glucagon and insulin levels do not differ 
between surface and cavefish, and that cavefish do not decrease blood 
glucose levels in response to arginine or insulin, suggest that cavefish 
may be insulin resistant. 

Insulin-stimulated glucose uptake proceeds through phosphorylation 
of AKT at serine 473 (pAKT)"". We compared the ratio of pAKT to AKT 
in freshly dissected skeletal muscle treated with recombinant insulin 
(Fig. 1f, g). Consistent with the apparent dysregulation of glucose home- 
ostasis, we observed lower pAKT levels in Tinaja cavefish than in surface 
fish (mean 0.775 versus 1.39 pAKT:AKT ratio, respectively, P=0.017; 
Fig. 1f, g), which suggests that Tinaja cavefish are indeed insulin resistant 
relative to surface fish. Pachon cavefish muscle also had lower pAKT 
levels in response to insulin (mean 0.806 pAKT:AKT ratio, P=0.027; 
Fig. 1f, g); however, despite having elevated blood glucose (Fig. 1b), 
Molino cavefish pAKT levels were equivalent to surface fish (mean 1.26 
pAKT:AKT ratio, P=0.99; Fig. 1f, g). Our results suggest that Tinaja and 
Pachon fish have evolved altered blood glucose regulation and insulin 
resistance in parallel, whereas Molino fish may have evolved their altered 
glucose metabolism through a different mechanism. 

To investigate the genetic mechanism underlying insulin resistance 
in cavefish, we examined the sequences of all known genes in the 
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condition). f, Western blot: cell lysates probed with pAKT (ser473) and 
AKT antibodies. Lysates produced from skeletal muscle treated ex vivo 
with PBS, a high (H, 9.5-11.5 1g ml!) or a low (L, 0.95-1.15 1g ml!) 
level of insulin. g, Quantification of bands by densitometry of highest 
concentration treatment (n = 3 per population). For box plots, median, 
25th, 50th and 75th percentiles are represented by horizontal bars, and 
vertical bars represent 1.5 x interquartile ranges. Significance calculated 
using one-way ANOVA with Tukey’s honest significant difference (HSD) 
post hoc test. NS, P > 0.05; *P < 0.05; **P < 0.005; ***P < 0.0005. For gel 
source data, see Supplementary Fig. 1. 


insulin pathway using the available genome sequence’? (Supplementary 
Information 1). Notably, we found a coding difference in the insulin 
receptor gene (insra) between surface fish and cavefish that affects a 
conserved proline in the extracellular cysteine-rich domain (P211L; 
Fig. 2a—c). The presence of the mutation correlates with insulin resist- 
ance, as Tinaja and Pachon populations carry the mutation whereas 
Molino cavefish have the wild-type allele (Fig. 2b, c). Notably, the 
same genetic alteration is implicated in at least two known cases of 
Rabson-Mendenhall syndrome'*"“, a form of severe insulin resistance 
in humans (Fig. 2c). The biochemical effect of the mutation has not 
previously been explored, but the position in the cysteine-rich domain 
suggests a role in insulin binding’. To test this, we generated transgenic 
HEK293T (Flp-In-293) cell lines that stably express the full-length sur- 
face fish or Tinaja cavefish insra and incubated the cells with different 
concentrations of fluorescein isothiocyanate (FITC)-labelled human 
insulin. We measured fluorescence as a readout for binding efficiency 
using an image-based cytometry approach (Imagestream X Mark II) 
and found that cells that expressed the cavefish receptor displayed sig- 
nificantly lower binding at all but the lowest concentrations of insulin 
(Fig. 2d). Our results suggest that the P211L mutation of insra affects 
insulin signalling by altering insulin binding efficiency. 

We next tested for the presence and frequency of the P211L mutation in 
wild-caught fish. We genotyped 71 surface fish from different localities, 
and 51 cavefish from 6 different caves (Supplementary Information 2; 
Fig. 3a). Consistent with our observations above, the mutation was 
absent in Molino fish (n = 8) but present in all other tested cave popu- 
lations (Tinaja, Yerbaniz, Pach6n, Japonés and Arroyo, combined 
n= 36). Notably, the cave populations carrying this mutation are all 
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Figure 2 | Coding mutation in the cavefish insulin receptor leads to 
decreased insulin binding. a, Schematic of the insulin receptor (adapted 
from ref. 24). Red asterisk depicts position of the P211L mutation. LR, 
leucine-rich repeats; CR, cysteine-rich domain; Fn, fibronectin type II 
domain; TM, transmembrane domain; TK, tyrosine-kinase domain. 

b, Sequence chromatogram of the mutation in Astyanax. c, Amino acid 
alignment of the insulin receptor P211L mutation with patients with 
Rabson-Mendelhall syndrome (‘Human (RMS)’). d, Relative FITC 
intensity of cells stably transfected with Flag-tagged surface-fish or 


derived from the same ancestral stock of surface fish!®. Although the 
mutation was present in heterozygote conditions in some of the caves, 
we did not find any cavefish homozygous for the surface allele (Fig. 3a). 
Our findings suggest that there is active selection for the mutation in 
the caves, and a partially dominant effect of the cave allele. We did not 
observe the cave allele in any of the surface fish, which suggests that the 
mutation either appeared de novo in the cave populations, represents a 
rare variant not detected by our sampling frequency or is absent in the 
current surface population but was present in the ancestral surface fish 
stocks!® (Supplementary Information 3). 

To investigate whether the P211L mutation contributes to altered glu- 
cose regulation, we measured fasting blood glucose levels in 192 F, fish 
from a surface fish-Tinaja cavefish cross fed ad libitum their entire lives 
(Extended Data Fig. 5). We found that elevated blood glucose—defined 
as levels that exceed the surface fish mean (59 mg di~') or maximum 
(75 mg dl~')—is a non-Mendelian trait in our cross: only 4.7%, rather 
than the expected 25%, of fish inherited elevated blood glucose, which 
indicates that the trait is not monogenic. Although F) fish with the 
P211L mutation did not differ significantly in blood glucose levels from 
those without the mutation, we found that only F, fish carrying the 
P211L mutation have elevated blood glucose levels exceeding the surface 
fish mean (n= 9) or maximum (n= 3), which suggests a necessary but 
not sufficient role for the mutation in altered blood glucose regulation. 

We found that cavefish weigh more than surface fish on a 
nutrient-limited diet (mean 2.08 versus 1.52 g, respectively, P=0.02; 
Fig. 3b). To investigate whether the insra mutation influences weight, 
we genotyped and weighed 124 surface-Tinaja male F; fish at approxi- 
mately 1.5 years of age that were fed ad libitum. We focused on males, as 
egg mass varies between individual females and can account for as much 
as 41% of female body weight (Extended Data Fig. 6), representing 
a confounding variable if included in our analysis. We found that males 
carrying one or two copies of the cave P211L insra allele weigh on 
average 27% more than hybrids that carry only the surface allele (mean 
1.63 versus 1.28 g, respectively, P= 0.006; Fig. 3c). Cavefish have an 
increased appetite that is associated with a mutation in the melano- 
cortin 4 receptor®. Although this mutation segregates independently 
from the insra mutation in the F, hybrid population, it is possible that 
another locus in cis to the P211L mutation influences appetite. To elimi- 
nate the effect of appetite on our analysis of the insra allele, we housed 
F, fish of approximately the same starting weight (<2 g) individually 
and ensured they ate a diet of 6 mg of food per day for 4 months. We 
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Tinaja-cavefish insulin receptor and incubated with FITC-labelled insulin. 
Each point represents mean FITC intensity of >2,500 live cells normalized 
to the mean intensity of untreated cells. Lines represent results from 

local polynomial regression fitting. Triangles (surface fish, filled; Tinaja 
cavefish, unfilled) represent data from competitive binding assay in which 
cells were incubated with 10|.M unlabelled insulin. Significance calculated 
using one-way ANOVA (between surface fish and Tinaja cavefish) with 
Tukey’s HSD post hoc test, *P < 0.05; **P < 0.005; ***P < 0.0005. 


found that homozygous P211L fish gained significantly more weight 
(mean 0.37 g, n = 20) than fish that did not carry the cave allele (0.19 g, 
n=60, P=0.02) consistent with the idea that variation at the insra locus 
influences weight gain independent of appetite regulation. 

To determine whether the insulin resistance and weight gain associ- 
ated with the P211L mutation in cavefish are indeed due to alteration of 
the insra gene, we used CRISPR gene editing to introduce the mutation 
into zebrafish (Danio rerio) via homology directed repair!” (Extended 
Data Fig. 7). We found that zebrafish homozygous for the P211L muta- 
tion have a lower ratio of pAKT to AKT in their skeletal muscle com- 
pared to heterozygous siblings, in both untreated (0.05 versus 0.13 in 
P211L-homozygous and heterozygous fish, respectively, P=0.016) and 
insulin-stimulated conditions (0.13 versus 0.32 in P211L-homozygous 
and heterozygous fish, respectively, P=0.067, n =3 per genotype and 
condition) (Fig. 3f). In addition, we found that zebrafish homozygous 
for the cave allele are longer (mean 20.0mm, versus 18.4mm in heterozy- 
gous fish, P=0.0046) and weigh more (mean 124.6 mg, versus 99.7 mg 
in heterozygous fish, P= 0.022) than their siblings when raised under 
the same conditions (Fig. 3e, g, h). Our findings show that the P211L 
mutation contributes to both the increased weight and insulin resistance 
observed in Tinaja and Pachon cavefish. Increased body weight in fish 
homozygous for the insulin-receptor mutation is unexpected, consider- 
ing that in mammals, full loss-of-function mutations in the insulin recep- 
tor are associated with retarded growth and lower levels of body fat'®. Our 
results suggest that diminished insulin signalling has an opposite effect 
in fish, but the mechanisms leading to the difference remain unclear. 

Cavefish are insulin resistant and hyperglycaemic; in humans these 
phenotypes precede and define type 2 diabetes, respectively. In addi- 
tion, cavefish have a fatty liver?, which is also frequently associated with 
type 2 diabetes. Transgenic zebrafish carrying the insra mutation show 
a reduction in scale size (Extended Data Fig. 8), as previously described 
in other hyperglycaemic zebrafish models’? and a similar phenotype has 
recently been described in cavefish””. In principle, these data could sug- 
gest an evolutionary tradeoff in which physiological health is sacrificed 
to reap the benefits of starvation resistance. However, whereas surface 
fish begin to exhibit signs of ageing such as sunken skin, tattered fins 
and a hunched back”! (Fig. 4a) by age 15, Tinaja and Pachén cavefish 
can live in excess of 14 years without these indications of senescence 
(Fig. 4b, c) and with no difference in fertility decline relative to surface 
fish. Cavefish may have evolved compensatory mechanisms enabling 
them to remain healthy despite potentially deleterious metabolic changes. 
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Figure 3 | The P211L mutation of insra is overrepresented in cave 
environments and is associated with higher body weight in surface- 
cave hybrids. a, Map of the region, overlain with genotyping results of 
wild-caught samples. Pie charts indicate percentage of fish homozygous 
for surface allele (blue), cave allele (orange) or heterozygous (grey). Size of 
pie chart roughly indicates the number of fish genotyped (Molino, n= 8; 
Surface, n =71; Pachon, n= 9; Yerbaniz, n = 8; Japonés, n =5; Arroyo, 
n=7; Tinaja, n= 14; position of pie charts corresponds to location and 
vertical order of population name on the map). The P211L allele is absent 
in all wild-caught surface fish and Molino cavefish (descended from a 
more-recent surface-fish lineage). The P211L mutation is present in all 
sampled cavefish populations descended from the more-ancient surface- 
fish lineage. Tinaja, Yerbaniz, Japonés and Arroyo are geographically close 
and believed to represent a single invasion event; Pachén represents an 
independent invasion”’. Map source: Imagery ©2017 Landsat/Copernicus, 
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Figure 4 | Despite elevated blood glucose levels and insulin resistance, 
Tinaja and Pachon cavefish do not show signs of senescence and do not 
accumulate advanced glycation end-products in the blood. a-c, Surface 
(a), Pachon (b) and Tinaja (c) fish kept in the laboratory for the indicated 
duration and fed ad libitum. Cavefish were wild-caught; ages represent 
minimum age. Surface fish (a) shows signs of ageing, such as loose skin and 
bent tails (yellow arrows), that are absent in cavefish at comparable ages (b, c). 
d, Quantification of advanced glycation end-products in serum (AGE-BSA) 
from approximately two-year-old fish after a three-day fast (n = 4 for each 
population). *P < 0.05, one-way ANOVA with Tukey’s HSD post hoc test. 
For box plots, median, 25th, 50th and 75th percentiles are represented by 
horizontal bars and vertical bars represent 1.5 x interquartile ranges. 
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Map data ©2017 Google, INEGI. b, Weight of Tinaja males (n= 6) and 
surface males (n =5) on a nutrient-limited diet. c, Weight of 18-month- 
old F, male Tinaja-surface hybrids genotyped for the P211L mutation. 
P-homozygous (P) surface fish, n = 22; L-homozygous (L) cavefish, n = 27; 
heterozygotes (P/L), n= 53. d, Change in weight of F; Tinaja-surface 
hybrid males on fixed diet. n = 21 (P), 39 (P/L) and 20 (L). e, Images of 
wild-type (WT) and homozygous P211L mutant (HOM) zebrafish siblings. 
f, Ratio of pAKT:AKT in adult zebrafish skeletal muscle treated ex vivo 
with PBS or insulin (n = 3 per genotype and condition). g, h, Length and 
weight of wild-type zebrafish (n = 13 (P)) and heterozygous (n = 22 (P/L)) 
and homozygous (n= 11 (L)) P211L mutant zebrafish. In box plots the 
median, 25th, 50th, and 75th percentiles are represented by horizontal 
bars and vertical bars represent 1.5 x interquartile ranges. Significance 
calculated using two-tailed students t-test, *P < 0.05. 


A major cause of morbidity in diabetic patients is tissue damage 
caused by excessive non-enzymatic glycation of proteins in the blood, 
generating advanced glycation end-products (AGEs)”*. AGEs are 
closely associated with diabetes-induced vascular damage, cardio- 
vascular disease and ageing”’. We compared the level of AGEs in the 
serum of two-year-old fish that had been fed ad libitum their entire lives 
(Fig. 4b). We did not detect a difference in the levels of AGEs between 
Tinaja and Pachon cavefish relative to surface fish (mean of 9.7, 9.3 and 
8.9\1g ml“, respectively, P=0.99, 0.95), despite the elevated blood glucose 
levels of the cavefish (Fig. 1b). These two populations may have mecha- 
nisms for reducing protein glycation, rendering them impervious to the 
damaging effects of elevated blood glucose. Notably, the Molino cavefish 
do have elevated levels of AGEs (mean 14.1 jig ml“, versus 8.9}1g ml! in 
surface fish, P=0.03). It remains to be determined whether the health and 
longevity of the Molino population is influenced by accumulation of AGEs, 
but our results suggest they may have evolved altered blood glucose homeo- 
stasis through a different mechanism than did Tinaja and Pachon cavefish. 

Our findings establish cavefish as a model with which to investi- 
gate resistance to pathologies of diabetes-like dysregulation of glucose 
homeostasis. Moreover, our results highlight the extreme physiological 
measures that can evolve in critical metabolic pathways to accommo- 
date exceptional environmental challenges. 

Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. Investigators were blinded to fish genotypes during experi- 
ments when possible. 
Fish husbandry and diet. Fish husbandry was performed as previously 
described”°. Unless stated otherwise, fish were fed ad libitum with a combination of 
New Life Spectrum TheraA + small fish formula and Artemia and housed at densi- 
ties of less than or equal to two adult fish per litre of water. F, hybrids were housed 
individually in 1.51] tanks and fed three pellets (~6 mg) of New Life Spectrum 
TheraA + small fish formula once per day for over 4 months. For the starvation 
experiment, fish were moved to individual containers and water was changed daily. 
Blood glucose, glucose tolerance and arginine tolerance. Blood was collected 
from the caudal tail vein using a U-100 insulin needle and glucose was measured 
using FreeStyle Lite blood glucose meter and test strips. 

Glucose (2.5 mg/gram fish), arginine (6.6 |1M/gram fish), or PBS was injected 
into the intraperitoneal cavity using a U-100 insulin needle. 
pAKT quantification. We quantified pAKT level in fillets of skeletal muscle taken 
directly after fish decapitation. For A. mexicanus, skeletal muscle was cut into 
three equal strips per fish. Strips were incubated in PBS, 0.1 or 1x concentration 
of recombinant human insulin for 25 min (Sigma product 19278; 1 x concentra- 
tion = 9.5-11.51g/ml insulin). The tissues were rinsed in PBS and then homoge- 
nized and lysed in RIPA buffer (Sigma) with protease and phosphatase inhibitor 
(Pierce Protease and Phosphatase Inhibitor Mini Tablets, EDTA Free) for 30 min. 
Protein concentration was measured via BCA (Pierce). Lysate protein concentra- 
tions were then equalized and run on 4-12% Bis-Tris protein gel and transferred on 
a nitrocellulose membrane. Blots were probed for AKT (Cell Signaling). Following 
stripping, blots were probed for pAKT (ser473) (Cell Signaling). ImageJ was used 
for densitometry measurements. For D. rerio two fillets of skeletal muscle were 
removed from both sides of fish directly after decapitation. Fillets were rinsed in 
PBS and then incubated in PBS or 10j1g/ml human recombinant insulin (Sigma, 
10908) for approximately 40 min. The skin was then removed from the skeletal 
muscle and the muscle was finely minced using a scalpel. We quantified the ratio 
of pAKT:AKT using the Akt(pS473) + total Akt ELISA Kit (Abcam ab126433) 
according to the manufacturers’ protocol. We used 200 il lysis buffer per sample 
and loaded 85 1 of lysate per well. 
insra P211L genotyping in A. mexicanus. Genomic DNA from tail fin clips was 
diluted fivefold and used as target DNA to amplify the insra locus using the following 
oligonucleotide primers: insra_f: GCACCCTTACACCCTTACATGA; insra_r: 
TACCGCTCAGCACTAATTTGGA,; product size: 700 bp. PCR reactions were 
carried out in 12.5-j1l volume containing 1 x LA PCR Buffer II (Clontech), 2.5 mM 
MgCl, 0.4mM dNTP mix, 0.4|1M of each forward and reverse primer and 0.05 
units of TaKaRa LA Taq DNA Polymerase (Clontech). The PCR cycling conditions 
were as follows: initial denaturation at 94°C for 2 min, followed by 35 cycles of 94°C 
for 30s, annealing temperature 52°C for 30s and 72°C for 1 min. A final 5-min 
elongation step was performed at 72°C. The PCR products were diluted tenfold 
and sequenced directly on a 3730XL DNA Analyzer (Applied Biosystems) using 
the sequencing primer: GGTGGAGTTGATGGTGGTATAG. 
Selection scans at the insra locus. We examined the insra locus with data that are 
a part of an ongoing genome-wide selection and demography companion study 
(A. Herman et al., manuscript in preparation). Methods are explained in greater 
detail in this demographic companion study; in brief, we used Illumina Hiseq 2000 
to sequence 100-bp reads from 6-10 individuals from each population of Tinaja 
cave, Molino cave, Pachon cave, Rascon surface and Rio Choy surface populations, 
(total n = 43) and two individuals from the sister taxa Astyanax aeneus. Individuals 
were sequenced with v3 chemistry. Reads were cleaned with Trimmomatic v.0.307” 
and cut-adapt v.1.2.178 and aligned to the reference Pachén cavefish genome using 
bwa-mem algorithm in bwa-0.7.1”? resulting in an aligned coverage depth of 
~7-12x. Variants were called using the Genome Analysis Toolkit v.3.3.0 (GATK)*? 
and Picard v.1.83 (http://broadinstitute.github.io/picard/). Outlier scan metrics 
(a, DXY, FST and Tajima’s D) were conducted using VCFtools v.0.1.13*! and custom 
scripts. HSCAN (https://messerlab.org/resources/) and hapFLK* were also used 
to examine insra for outliers. Metrics were dense-ranked across the genome and 
the ranking position of insra was used to determine whether it was exceptionally 
divergent between cave and surface populations relative to the rest of the genome. 
Glucagon and insulin quantification. The number of cells producing insulin or 
glucagon was determined using the following protocol: 10-11 days post fertiliza- 
tion fish were euthanized with an overdose of tricaine and fixed in 4% paraform- 
aldehyde overnight at 4°C. Fish were washed in PBST, transferred to water for 
1 min, acetone for 10 min at —20°C, water for 1 min, and then washed in PBST. 
Blocking in 5% donkey serum, 1% DMSO and 0.2% BSA was done for 1h at room 
temperature. Fish were incubated with primary antibodies (1:200 sheep anti- 
glucagon (Abcam), 1:200 guinea pig anti-insulin (DAKO)) and then secondary 
antibodies (1:400 donkey anti-sheep 488, 1:400 goat anti-guinea pig 647) overnight 


at room temperature in glass vials, then washed with PBST, stained with DAPI 
and imaged. Images were collected at 63 x using a 1.0|1m z-stack on a Zeiss 780 
confocal microscope. Nuclei surrounded by insulin or only glucagon were counted 
manually using Fiji cell counter. 

To quantify circulating glucagon level, we collected serum from the caudal tail 
vein of fish that were approximately 2-years-old and had fasted for 24h (n= 12 for 
each population). The serum was used for a glucagon radioimmunoassay according 
to the manufacturer’s protocol (MGL-32K; Millipore). To quantify circulating insu- 
lin levels, we collected plasma from the caudal tail vein of 2-year-old and 1-year-old 
fish and blotted the serum onto a nitrocellulose membrane using a Bio-dot sf device 
(Bio-Rad). Ponceau protein staining was used to verify equal protein loading, after 
which insulin was probed using anti-insulin (DAKO). Quantification of insulin 
levels was done using Fiji. 

Insulin-binding experiment. The full-length A. mexicanus insra protein-coding 
sequence was amplified from surface-fish and Tinaja-cavefish cDNA and cloned 
into a modified pcDNA3.1/Hygro vector providing an N-terminal Flag epitope 
tag**. To generate stable cell lines, the Flag-tagged insra cassettes were cloned into 
pcDNA5/ERT vector (Invitrogen, catalogue number V601020) allowing for Flp 
recombinase-mediated integration into the Flp-In-293 cell line according to the 
manufacturer's procedures. The FLP-In-293 cell line was originally purchased from 
Invitrogen/Thermo Fisher Scientific (catalogue number R75007), then cell-banked 
internally at the Stowers Institute. Cell line authentication was performed by 
Promega/ATCC, using short tandem repeat (STR) profiling. Cell lines were tested 
for mycoplasma contamination using the Universal Mycoplasma Detection Kit, 
ATCC 30-1012. One positive clone from each surface-fish and Tinaja-cavefish cell 
line was selected and used for the insulin-binding assays using a protocol derived 
from a previously published protocol™. In brief, 100-mm plates were seeded at 
30% confluency and cultured in DMEM, 10% FBS +1 Glx medium for 48h. 
The plates were then changed to insulin-free FreeStyle 293 Expression Medium 
(catalogue number 12338018) and incubated for an additional 24h. Plates at 
~70-80% confluency were pre-chilled for 30 min at 4°C and the medium was 
replaced with 5 ml of cold FreeStyle 293 Expression Medium containing 42 mM 
HEPES pH7.5 and human FITC-labelled insulin (Sigma, catalogue number 13661) 
at final concentrations of 0, 0.1, 0.2, 0.4, 0.8, 1 and 3|1M. For binding competition, 
10\1M of unlabelled human insulin (Sigma, catalogue number 19278) and 11M 
of human FITC-labelled insulin were added. After one hour of incubation in the 
dark at 4°C, the medium was removed and the plates were washed with 5 ml of 
cold 1x PBS. Cells were dissociated in 2 ml of 0.5 mM EDTA in PBS at 37°C for 
7 min, transferred into Eppendorf tubes, pelleted for 5 min at 200g at 4°C and 
resuspended in 1 ml of cold 1x PBS. To stain dead cells, 1 tl of Fixable Viability Dye 
(FVD eFluor450, Invitrogen, catalogue number 65-0863) was added to each 1 ml of 
cell suspension and incubated on ice in the dark for 20 min. Subsequently, the cells 
were washed once with cold 1x PBS and fixed in 1 ml of 4% formaldehyde. After 
two more washes with cold 1x PBS, the cells were resuspended in 15011 of cold 
1x PBS, filtered through a 70-\1m cell strainer (Filcons 070-67-S) and trans- 
ferred into a round-bottom 96-well plate. Binding data were acquired on an 
ImagestreamX MarkII (EMD Millipore) at 40x. FITC was excited with 150 mW 
488 nm. Fixable live/dead was excited with 12 mW 405 nm. Single colour con- 
trols were used for colour compensation. Bright field was collected on different 
channels. Analysis was performed in IDEAS v.6.2 and fluorescence intensity was 
reported as integrated intensity within an adaptive erode mask for bright field. 
Genome editing in zebrafish using the CRISPR-Cas9 technology. We intro- 
duced the cavefish P211L coding alteration into D. rerio by causing a precise C632T 
base pair exchange on the third exon of the insra gene (Extended Data Fig. 7). We 
designed the guide RNA target site using the web tool of the MIT Zhang labora- 
tory (http://crispr.mit.edu). We then validated the target region and checked for 
naturally occurring SNPs by PCR and sequencing of genomic DNA using the 
following oligonucleotide forward, reverse and sequencing primers, respectively: 
(1) TGAGGTGTGTCGAGTGTTCT; (2) TGTGCGTGTTCGTTGAGTGT, 
product size of 598 bp; and (3) CAGCCCTTGAAGGTGTAGAA. The PCR reac- 
tion components and procedures followed that described in ‘insra P211L geno- 
typing in A. mexicanus’. 

We used Cas9 protein from PNABio and 2-part Alt-R guide RNAs from IDT. 
Single-stranded oligodeoxynucleotides (ssODNs) were ordered as ultramers from 
IDT for generating the SNP mutations. The ssODNs included 100 bp of homology 
arms as well as silent SNP mutations in the guide RNA target site to prevent 
re-annealing of the guide RNA following homologous recombination. We also 
protected both ends of the ssODNs with three phosphorothioate bonds to inhibit 
exonuclease degradation in the cell. To form the guide RNA complex, we first 
annealed the specific CRISPR RNA (crRNA) with trans-activating crRNA (11M), 
followed by hybridization with the Cas9 protein (6.6 |1M) to form a ribonucleo- 
protein complex. This mixture was incubated for 10 min. Next, we combined the 
ribonucleoprotein complex with the ssODNs (50 ng/1) to form a total volume of 
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15,11 of injection mix, which was then injected into the cell at the 1-cell stage in at 
least 200 and up to 1,200 zebrafish embryos. 

insra P211L genotyping in zebrafish. We designed a genotypic screening assay to 
confirm the expected location of the P211L SNP mutation at various points through 
development. First, after injections we divided the embryos into two groups; group 
1 to screen at the embryo stage and group 2 to raise if the mutation was detected 
in group 1. Out of the 93 embryos screened, 34 fish exhibited a mixture of inser- 
tion and deletion mutations (35.5%), and at least 1 fish showed the exact SNP 
mutation (1.1%). Once these fish matured, we then identified adult mosaic indi- 
viduals. Out of the 64 screened Fo adult fish, 17 fish carried the intended P211L 
SNP mutation with the corresponding silent SNP changes (26.6%), but only 5 fish 
showed germline transmission (7.81%) when outcrossed to wild-type zebrafish. 
Next, we genotyped the F; embryos to calculate the germline transmission rate 
of each Fy mosaic founder. The germline transmission rate ranged from 8.3% to 
45.8% in the 5 Fp fish that showed positive germline transmission. We then raised 
the progenies of these mosaic founders. Lastly, we genotyped these F; progenies 
upon maturity and maintained the line by outcrossing heterozygous mutants to 
wild-type fish in each subsequent generation. To test whether the P211L muta- 
tion contributes to zebrafish weight, we paired heterozygous fish with successful 
germline transmission and weighed their progeny shortly before maturity (55 days 
post fertilization) to avoid gonadal effects on weight. 

Quantification of advanced glycation end-products. We used Oxiselect 
Advanced Glycation End Product (AGE) Competitive ELISA Kit according to the 
manufacturer’s protocol to measure AGE level in serum from two-year-old fish 
that were fasted for three days. 

Statistics and figure preparation. Graphical data and statistics were produced 
using R* and ggplot2*° package. We used Shapiro-Wilk to test for normality. We 
tested significance in normally distributed data using a t-test and non-parametric 
data using Mann-Whitney U-test. For non-parametric data with multiple ties we 
used a t-test. For comparison of more than two groups we used a one-way ANOVA 
with Tukey’s HSD post hoc test. 
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Animal experiment statement. Research and animal care were approved by the 
Institutional Animal Care and Use Committees of the laboratories involved. 
Data availability. Original data underlying this manuscript can be accessed 
from the Stowers Original Data Repository at http://www.stowers.org/research/ 
publications/libpb-1205 and/or are available from the corresponding authors on 
reasonable request. 
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Extended Data Figure 1 | Numbers of insulin- and glucagon-positive in surface and Tinaja larvae at 10-11 days post fertilization (n =5 fish 
cells in the developing pancreas are unchanged in Tinaja cavefish per population,). c, Average number of glucagon- and insulin-positive 
relative to surface fish. a, Whole-mount immunohistochemical cells, fish length, ratio of insulin to glucagon positive cells and P value 
detection of insulin- and glucagon-positive cells in Tinaja larvae at 10 comparing the surface and Tinaja values (determined using Student's 


days post fertilization. b, Number of glucagon- and insulin-positive cells t-test). 
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Extended Data Figure 2 | Serum glucagon levels are comparable respectively. P= 0.52, one-way ANOVA. Box plots show 25th, 50th and 
between the different populations. Box plot comparing serum glucagon 75th percentiles (horizontal bars), and 1.5x interquartile ranges (error 
levels between surface, Tinaja, Molino and Pachon fish after 24-h fast. bars), dots represent outliers. 


n= 12 fish per population, average of 57.87, 59.76, 79.66 and 48.89 
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Extended Data Figure 3 | Serum insulin levels are comparable between AU, artificial units; median, 25th, 50th and 75th percentiles (horizontal 
surface and Tinaja fish. a, Serum blotted onto nitrocellulose membrane bars) and error bars at 1.5 x interquartile ranges. Tinaja cavefish insulin 
using Bio-Dot SF microfiltration apparatus (Bio-Rad, catalogue number levels (mean = 10,770) tended to be higher than those of surface fish 
1706542) probed with anti-insulin antibody (DAKO). Each blot represents (mean =7,194) but the results are not significant (P = 0.057, Student’s 
an individual fish between 1- and 2-years-old (n= 24 fish per population). two-sample t-test). 
b, Quantification of insulin level measured by densitometry of blots. 
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Extended Data Figure 4 | Insulin decreases blood glucose level 
in surface fish. a, We injected different concentrations of human 
recombinant insulin (Sigma, product 19278, stock 9.5-11.5mg ml~ ‘) into 
the intraperitoneal cavity of surface fish to determine the effective dosage 
for subsequent experiments. Blood glucose levels are significantly lower 
after injection of approximately 0.6 or 0.06 g insulin per mg of fish weight 
compared to 0.0006 g (30 min after injection, n = 4 fish per dosage, dots 
represent individual fish, significance calculated using one-way ANOVA 
with Tukey’s HSD post hoc test, *P < 0.05). We used 0.06 g insulin per 
mg of fish in subsequent experiments. b, Blood glucose levels of surface 
fish over time, after injection of PBS or insulin. Blood glucose levels are 
significantly lower at 60 and 90 min compared to 15 min after insulin 
injection (n= 10 fish per time point and condition, significance calculated 
using one-way ANOVA with Tukey’s HSD post hoc test, *P < 0.05). 
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Therefore, we focused on the 60-min time point for comparisons with 
Tinaja cavefish. c, Blood glucose levels at 15 and 60 min after insulin 
injection in surface fish and Tinaja cavefish. Surface fish display a 
significant decrease in blood glucose levels, whereas cavefish display a 
significant increase in blood glucose levels (significance calculated using 
two-tailed Student's t-test, *P < 0.05, ***P < 0.0005). Tinaja cavefish 
blood glucose levels may increase owing to the stress of being injected; 
stress hormones, such as catecholamines, ACTH and epinephrine, cause 
transient increases in blood glucose in humans and mice, an effect 

that cannot be mitigated in the absence of insulin signalling. Although 
both cavefish and surface fish probably undergo a stress response upon 
injection, this is overcome in the surface fish, which have wild-type 
insulin activity, but not in the Tinaja cavefish, which have reduced insulin 
signalling. 
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Extended Data Figure 5 | An elevated fasting blood glucose level 24h after feeding. All of the fish with elevated blood glucose (greater than 
correlates with the presence of the P211L allele in F, hybrids. Blood 60mg di!) carry the P211L allele. Median, 25th, 50th and 75th percentiles 
glucose levels of 192 F, surface-Tinaja hybrids of the indicated genotype (horizontal bars), and error bars at 1.5 interquartile ranges. 
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Extended Data Figure 6 | Egg mass is a confounding variable in female fish. 
a, Image of an F, surface-Tinaja hybrid female and removed gonad, of indicated 
weights. b, Histogram displaying per cent gonad weight (gonad weight/total 
weight, multiplied by 100) of 62 F, surface-Tinaja hybrid females fed 6 mg 

per day for 4 months (minimum = 3.57, Ist quartile = 10.64, median = 13.96, 
mean = 13.96, 3rd quartile = 17.57 and maximum = 41.86). 
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GenomicDNA§’—s... TGTGTCCATCCTCATGCAAGCACCGCGCCTGC... 3" 
3° .,.,ACACAGGTAGGAGTACGTTCGIGGCGCGGACG... 5’ 


ssODN 3’ G*G*A*AATTTACATTGGGGACAAAGAGATAAAGAAGGGTGTCACACAGATAGTAGAACATTCGTGTCTCGGACGTGATTTCTGTTGGTCACGACGGTGC*T*G*G 5 


Extended Data Figure 7 | Genome editing strategy. CRISPR-Cas9 The intended SNP exchanges are underlined, and the specific C632T to 
mediated genome editing strategy in exon 3 of the insulin receptor a(insra) alter the P211 to L is denoted with a star. Both ends of the ssODN are 
zebrafish gene. The guide RNA target sequence is emphasized in bold in protected by three phosphorothioate bonds, denoted with asterisks. 
both the reverse strand of the wild-type genomic DNA and in the ssODN. The protospacer adjacent motif is shown in orange. 
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Extended Data Figure 8 | Scale growth is impaired in the insra zebrafish _ of the ventral fin by gentle scraping with a scalpel. Wild type (P), n=4, 
mutant. a, Quantification of scale size in zebrafish of the indicated P211L mutant (L), n =3. b, c, Representative images of scales stained with 
genotype; each point represents the mean scale size of an individual fish 0.005% calcein with contrast and brightness adjusted to show scale edges. 
based on the measurement of 10-14 scales removed from the left side of Significance calculated using two-way t-test of mean values, *P < 0.05. 


the body from the posterior edge of the dorsal fin to the posterior edge 
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Reconstructing the genetic history of late 


Neanderthals 


Mateja Hajdinjak!, Qiaomei Fu***, Alexander Hiibner!, Martin Petr!, Fabrizio Mafessoni', Steffi Grote!, Pontus Skoglund®, 
Vagheesh Narasimham‘, Héleéne Rougier®, Isabelle Crevecoeur’, Patrick Semal®, Marie Soressi?!, Sahra Talamo"®, 
Jean-Jacques Hublin!®, Ivan Gusié!', Zeljko Kuéan", Pavao Rudan", Liubov V. Golovanova!’, Vladimir B. Doronichev”, 
Cosimo Posth!*:4, Johannes Krause!*"", Petra Korlevic¢!, Sarah Nagel!, Birgit Nickel!, Montgomery Slatkin’, Nick Patterson>"®, 
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Although it has previously been shown that Neanderthals 
contributed DNA to modern humans!??, not much is known 
about the genetic diversity of Neanderthals or the relationship 
between late Neanderthal populations at the time at which their 
last interactions with early modern humans occurred and before 
they eventually disappeared. Our ability to retrieve DNA from a 
larger number of Neanderthal individuals has been limited by 
poor preservation of endogenous DNA? and contamination of 
Neanderthal skeletal remains by large amounts of microbial and 
present-day human DNA?>. Here we use hypochlorite treatment® 
of as little as 9 mg of bone or tooth powder to generate between 
1- and 2.7-fold genomic coverage of five Neanderthals who lived 
around 39,000 to 47,000 years ago (that is, late Neanderthals), 
thereby doubling the number of Neanderthals for which genome 
sequences are available. Genetic similarity among late Neanderthals 
is well predicted by their geographical location, and comparison to 
the genome of an older Neanderthal from the Caucasus”” indicates 
that a population turnover is likely to have occurred, either in the 
Caucasus or throughout Europe, towards the end of Neanderthal 
history. We find that the bulk of Neanderthal gene flow into early 
modern humans originated from one or more source populations 
that diverged from the Neanderthals that were studied here at least 
70,000 years ago, but after they split from a previously sequenced 
Neanderthal from Siberia” around 150,000 years ago. Although four 
of the Neanderthals studied here post-date the putative arrival of 
early modern humans into Europe, we do not detect any recent gene 
flow from early modern humans in their ancestry. 

The Middle to Upper Palaeolithic transition in Europe was char- 
acterized by major cultural and biological changes that coincided 
with the arrival of early modern humans and the disappearance of 
Neanderthals*’. Analyses of the first Neanderthal genomes have pro- 
vided evidence of gene flow from Neanderthals into modern humans 
between 50,000 and 60,000 years ago (ka), resulting in around 2% 
of Neanderthal DNA in the genomes of non-Africans today!”!”. 
Additionally, genetic analyses of an approximately 39,000-42,000-year- 
old modern human from Romania (Oase 1) showed that interbreeding 
between Neanderthals and modern humans also happened in Europe 
at a later point in time!!. However, little is known about the diversity 
of late Neanderthal populations across Europe and western Asia 
shortly before their disappearance, or about their relationship to 


the population that admixed with early modern humans. To date, only 
a handful of Neanderthal remains have been identified with a suffi- 
ciently high content of endogenous DNA and low enough levels of 
microbial and human DNA contamination? to allow analysis of larger 
parts of their genomes’, limiting our ability to study their genetic 
history. 

In an attempt to make more Neanderthal genomes available for pop- 
ulation analyses, we identified five specimens with sufficient preserva- 
tion of endogenous DNA to explore the possibility of nuclear genome 
sequencing (Fig. la and Supplementary Information 1): a fragment of 
a right femur (Goyet Q56-1) dated to 43,000-42,080 calibrated years 
(cal. yr) before present (BP; taken to be AD 1950) from the Troisiéme 
caverne of Goyet in Belgium’; an upper right molar (Spy 94a) that is 
associated with a maxillary fragment dated to 39,150-37,880 cal. yr BP 
from the neighbouring Spy cave in Belgium!; a tooth (Les Cottés 
ZA-1514) dated to 43,740-42,720 cal. yr BP from Les Cottés cave!4 in 
France; an undiagnosed bone fragment found in Vindija cave in Croatia 
(Vindija 87) dated to be older than 44,000 uncalibrated years Bp; and a 
skull fragment of an infant from Mezmaiskaya cave (Mezmaiskaya 2) 
in the Russian Caucasus dated to 44,600-42,960 cal. yr Bp! 

We extracted DNA from between 9 and 58 mg of bone or tooth 
powder. Approximately half of the powder from each specimen was 
treated with 0.5% hypochlorite solution before DNA extraction to 
remove present-day human and microbial DNA contamination’®. 
Hypochlorite treatment increased the proportion of DNA fragments 
mapping to the human reference genome between 5.6- and 161-fold 
(Fig. 1b and Supplementary Information 2), and reduced present-day 
human contamination in four of the specimens between 2- and 18-fold 
(Fig. 1c and Supplementary Information 2). This substantial increase in 
the proportion of informative fragments made whole-genome sequenc- 
ing of these previously inaccessible specimens feasible. 

We generated additional single-stranded DNA libraries from 
selected extracts and sequenced them to an average genomic cover- 
age of 2.7-fold for Les Cottés Z4-1514, 2.2-fold for Goyet Q56-1, 1.7- 
fold for Mezmaiskaya 2, 1.3-fold for Vindija 87 and 1-fold for Spy 94a 
(Extended Data Table 1 and Supplementary Information 3). Estimates 
of mitochondrial (mt) and nuclear DNA contamination ranged from 
0.52% to 5.06% and from 0.18% to 1.75%, respectively (Supplementary 
Information 4). Therefore, we restricted analyses to fragments carrying 
cytosine (C) to thymine (T) substitutions at their ends as these derive 
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Les Cottés Goyet Mezmaiskaya Vindija 
Q56-1 2 87 
Figure 1 | Specimen information and the effects of 0.5% hypochlorite 
treatment. a, Location and age of the five late Neanderthal specimens 
analysed in this study (new), and other sites for which genome-wide data 
of Neanderthal samples have previously been published (old). Map source: 
Vectormapcollection. b, Proportion of DNA fragments aligned to the 
human reference genome in untreated bone and tooth powder and in 


from deamination of cytosine to uracil and indicate that DNA mole- 
cules are of ancient origin'®!” (Extended Data Figs 1, 2). This reduced 
the mtDNA contamination to 0.39-1.61% and the autosomal contami- 
nation to 0-0.81% (Supplementary Information 4). To mitigate the 
influence of deamination on genetic inferences, we further restricted 
the analyses to only transversion polymorphisms (see Supplementary 
Information 6). 

A phylogenetic tree (Fig. 2a) of the reconstructed complete mito- 
chondrial genomes places these five specimens within the Neanderthal 
mtDNA variation. The relationship of the mtDNA of Les Cottés 
Z4-1514 to the mtDNAs of Neanderthals from across their entire geo- 
graphic range—including older Neanderthals from Okladnikov and 
Denisova cave (Supplementary Information 5)—together with the 
grouping of Mezmaiskaya 2 with late Neanderthals, including Feldhofer 
2, challenges the previously proposed division between Eastern and 
Western mtDNAs in late-surviving Neanderthals’. As inferred from 
the sequence coverage of the X chromosome and the autosomes, Goyet 
Q56-1, Les Cottés Z4-1514 and Vindija 87 were females, whereas 
Mezmaiskaya 2 and Spy 94a were males (Extended Data Fig. 3). The 
Y chromosome sequences of both male individuals fall outside the 
known variation of present-day human Y chromosomes (Fig. 2b and 
Supplementary Information 5), as is the case for the Y chromosome of 
a Neanderthal from El Sidrén, Spain’. 

We analysed the genomes of the five late Neanderthals with the 
previously published high-quality genomes of an approximately 
120,000-year-old Neanderthal from Siberia (Altai Neanderthal)” and 
Neanderthal from Croatia (Vindija 33.19) that is more than 45,000 
years old’, the low-coverage genome of a Neanderthal who lived 
approximately 60-70 ka from Mezmaiskaya cave (Mezmaiskaya 1)”, 
the composite low-coverage genome of three Neanderthals from 
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powder treated with 0.5% sodium hypochlorite (Supplementary Table 2.1). 
c, Proportion of present-day human contamination (with 95% binomial 
confidence intervals) inferred from mtDNA fragments in treated and 
untreated samples. Two-sided Pearson's \” tests were used to calculate 
significant differences (**a < 0.001) (Supplementary Table 2.5). 


Croatia’, the high-coverage genome of a Denisovan individual” and 
a world-wide panel of present-day humans*”!. On the basis of derived 
alleles that are shared with the Vindija 33.19 genome’, we find that the 
Vindija 87 specimen originates from the same individual as Vindija 
33.19. We therefore excluded Vindija 87 from subsequent analyses 
(Supplementary Information 7). 

A neighbour-joining tree based on the number of pairwise trans- 
versions between individuals shows that all Neanderthals form a 
monophyletic clade relative to the Denisovan individual (Fig. 2c and 
Supplementary Information 7, 8). The tree reflects an apparent age- 
related division among the Neanderthals with the oldest specimen, 
the Altai Neanderthal branching off first, followed by Mezmaiskaya 1, 
whereas the late Neanderthals form a clade. To assess the relationship 
of the late Neanderthals to the high-coverage genomes of the Altai and 
Vindija 33.19 Neanderthals, we used two related statistics that meas- 
ure the fraction of derived allele-sharing between the genomes. We 
find that all late Neanderthals and Mezmaiskaya 1 share significantly 
more derived alleles with Vindija 33.19 than with the Altai Neanderthal 
(—32.1< Z< —58.1; Extended Data Table 2 and Supplementary 
Information 9) and that late Neanderthals share on average 49.0% 
(95% confidence interval, 44.2-54.2%) of the derived alleles seen in 
Vindija 33.19 and not in other high-coverage genomes in the analysis, 
whereas they share 17.2% (95% confidence interval, 15.9-18.3%) of the 
derived alleles in the Altai Neanderthal (Extended Data Table 3 and 
Supplementary Information 7). 

Obtaining genome-wide data of multiple late Neanderthals from a 
broad geographical range enabled us to determine whether related- 
ness among Neanderthals is correlated to their geographical proximity, 
as is the case for present-day humans”. In support of this, we find 
that the two Neanderthals from Belgium share more derived alleles 
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Figure 2 | Phylogenetic relationships of late Neanderthals. a, Bayesian 
phylogenetic tree of mitochondrial genomes of 23 Neanderthals, 

3 Denisovans, 64 modern humans and a hominin from Sima de los 
Huesos. The posterior probabilities for the branches are shown. 

b, Neighbour-joining tree of Y chromosome sequences of Mezmaiskaya 
2, Spy 94a, 175 present-day humans”! and two present-day humans 


with each other than with any other Neanderthal (—3.65 < Z < —8.47; 
Supplementary Information 9), and in turn more derived alleles with 
Neanderthals from France and Croatia than with the late Neanderthal 
from the Caucasus. Similarly, the four Neanderthals from Vindija cave 
that come from a relatively narrow time range share more derived 
alleles with each other than with other Neanderthals (—2.2<Z< 
—14.5; Supplementary Information 9). Furthermore, specimens with 
a similar age and the largest geographical distance between them 
(Les Cottés Z4-1514 and Mezmaiskaya 2) shared the fewest derived 
alleles (Supplementary Information 9). By contrast, Mezmaiskaya 2 
shared more derived alleles with the other late Neanderthals than with 
Mezmaiskaya 1 (—2.13 < Z < —9.56; Supplementary Information 9), 
suggesting that there was a population turnover towards the end of 
Neanderthal history. This turnover may have been the result of a pop- 
ulation related to western Neanderthals replacing earlier Neanderthals 
in the Caucasus, or the replacement of Neanderthals in western Europe 
by a population related to Mezmaiskaya 2. The timing of this turnover 
coincides with pronounced climatic fluctuations during Marine Isotope 
Stage 3 between 60 and 24ka”’, when extreme cold periods in northern 
Europe may have triggered the local extinction of Neanderthal popula- 
tions and subsequent re-colonization from refugia in southern Europe 
or western Asia”**>. 


654 | NATURE | VOL 555 | 29 MARCH 2018 


b Y-chromosomal DNA 


48 


Present-day 
modern humans 


c Nuclear DNA 


100 Present-day 
modern humans 


carrying the A00 haplogroup*’. The number of substitutions is shown 
above the branches. c, Neighbour-joining tree of nuclear genomes based 
on autosomal transversions among late Neanderthals, Vindija 33.19, 
Mezmaiskaya 1, Altai Neanderthal, Denisovan and 12 present-day 
humans. Bootstrap support values after 1,000 replications are shown. 


We estimated the population split times between each of the low- 
coverage Neanderthal genomes and the two high-coverage Neanderthal 
genomes by determining the fraction of sites at which each of the 
low-coverage Neanderthal genomes shares a derived allele that occurs 
in the heterozygote state in one of the high-quality genomes (F(A|B) 
statistics'”°). This fraction was then used to estimate the population 
split times for each pair of Neanderthals using previous inferences of 
how Neanderthal population sizes changed over time’. Owing to the 
uncertainties in the mutation rate and generation times, we caution 
that although the times presented are likely to accurately reflect the 
relative ages of the population split times, the absolute estimates in 
years are approximate. We estimate that the late Neanderthals studied 
here separated from a common ancestor with the Altai Neanderthal 
around 150ka (95% confidence interval, 142-186 ka), and from a com- 
mon ancestor with Vindija 33.19 about 70 ka (95% confidence interval, 
58-72 ka; Extended Data Table 4 and Supplementary Information 8). 
The estimates of the population split times from the common ancestors 
shared with the Denisovan and with modern humans are around 400ka 
(95% confidence interval, 367-484 ka) and about 530 ka (95% confi- 
dence interval, 503-565 ka; Extended Data Table 4 and Supplementary 
Information 8), respectively, consistent with previous estimates using 
the Altai and Vindija 33.19 Neanderthal genomes!”°. 
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Figure 3 | Proximity to the introgressing Neanderthal populations in 
present-day and ancient humans calculated using D(Neanderthall1, 
Neanderthal2; non-African, African). Three Mbuti individuals from the 
Simons Genome Diversity Project (SGDP)! were used as an outgroup 
and standard errors were calculated using a weighted block jackknife 
(Supplementary Information 10). Shaded grey region corresponds 


To investigate whether any of the Neanderthals sequenced to date 
is more closely related to the Neanderthal population that contributed 
genetic material to modern humans, we compared the Neanderthal 
genomes from this and previous studies to the genomes of 263 present- 
day humans”! as well as a number of early modern humans!®!1?67, 
We find that all late Neanderthals and the older Mezmaiskaya 1 
Neanderthal share significantly more derived alleles with the intro- 
gressing Neanderthals than the Altai Neanderthal does (—2.4<Z< 
—5.6; Fig. 3a and Supplementary Information 10), with no significant 
differences among them (—0.1 <Z< 1.8; Fig. 3b and Supplementary 
Information 10). Interestingly, this is also true for an approximately 
45,000-year-old modern human from Siberia (Ust -Ishim) (Fig. 3 and 
Supplementary Information 10), who was contemporaneous with late 
Neanderthals, but is not a direct ancestor of present-day humans!®, 
Thus, the majority of gene flow into early modern humans appears 
to have originated from one or more Neanderthal populations that 
diverged from other late Neanderthals after their split from the Altai 
Neanderthal about 150 ka, but before the split from Mezmaiskaya 1 at 
least 90 ka (Extended Data Table 4). Owing to the scarcity of overlap- 
ping genetic data from Oase 1, whose genome revealed an unusually 
high percentage of Neanderthal ancestry!!, we were unable to resolve 
whether one of these late Neanderthals was significantly closer than 
others to the introgressing Neanderthal in Oase 1. 

Interbreeding between Neanderthals and early modern humans is 
likely to have occurred intermittently, presumably resulting in gene 
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to D<0.a, All late Neanderthals and the older Mezmaiskaya 1 are 
significantly closer to the introgressing Neanderthal population(s) than 
the Altai Neanderthal. b, There is no significant difference between late 
Neanderthals, Mezmaiskaya 1 and Vindija 33.19 in their proximity to 
the introgressing Neanderthal population(s) in present-day and ancient 
humans. 


flow in both directions”*. However, when we applied an approach that 
uses the extended length of haplotypes expected from recent intro- 
gression into the analysed late Neanderthals, we did not find any 
indications of recent gene flow from early modern humans to the late 
Neanderthals (Supplementary Information 11). We caution that given 
the small number of analysed Neanderthals we cannot exclude that 
such gene flow occurred. However, it is striking that Oase 1, one of two 
early modern humans that overlapped in time with late Neanderthals, 
showed evidence for recent additional Neanderthal introgression'®"! 
whereas none of the late Neanderthals analysed here do. This may 
indicate that gene flow affected the ancestry of modern human 
populations more than it did Neanderthals”. Further work is necessary 
to determine whether this was the case. Our work demonstrates that 
the generation of genome sequences from a large number of archaic 
human individuals is now technically feasible, and opens up the pos- 
sibility to study Neanderthal populations across their temporal and 
geographical range. 

Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not blinded to 
allocation during experiments and outcome assessment. 

DNA extraction and library preparation. All specimens were sampled in clean 
room facilities dedicated to the analysis of ancient DNA. Between 28 mg and 
104 mg of tooth or bone powder was obtained by drilling once into the physi- 
cally cleaned part of the specimen and split evenly (Supplementary Information 
2; Supplementary Table 2.1). Approximately half of the powder was directly sub- 
jected to DNA extraction using a silica-based method?! as implemented in ref. 6 
(the untreated sample), whereas the second half was treated with a 0.5% sodium 
hypochlorite solution® before DNA extraction in an attempt to remove some of the 
microbial and present-day human DNA contamination* *. Subsequently, 5 or 10 ji 
of each extract was converted into single-stranded DNA libraries*” with the modi- 
fications as in ref. 6. The initial libraries of Vindija 87, Goyet Q56-1 and Les Cottés 
ZA4-1514 were treated with Escherichia coli uracil-DNA-glycosylase (UDG) and 
E. coli endonuclease VIII (Endo VIII)'>” to excise uracils, while all other libraries 
were prepared without this enzymatic treatment (Supplementary Information 2). 
The libraries were amplified into plateau*’ and tagged with two sample-specific 
indices®**. An aliquot of each amplified library was additionally enriched for 
hominin mtDNA using a bead-based hybridization method and modern human 
mtDNA as a bait*>~*”. After analysing the first set of libraries, we selected the 
extracts with the highest proportion of endogenous DNA and the lowest levels of 
present-day human DNA contamination to produce the final set of single-stranded 
DNA libraries®” (Supplementary Information 2 and Supplementary Table 2.6). 
Genome sequencing and data processing. All libraries were initially sequenced 
together on Illumina’s MiSeq and HiSeq 2500 platforms to determine their 
suitability for whole-genome sequencing. Subsequently, 23 libraries from five 
Neanderthal specimens were selected and sequenced on 50 lanes of the Illumina 
HiSeq 2500 platform in rapid mode, using double index configuration (2 76 bp)** 
(Supplementary Information 2). Base calling was done using Bustard (Illumina) 
for the MiSeq runs and Freelbis** for the HiSeq runs. Adapters were trimmed 
and overlapping paired-end reads were merged into single sequences using 
leeHom”’. The Burrows—Wheeler aligner (BWA, version 0.5.10-evan.9-1- 
g44db244; https://github.com/mpieva/network-aware-bwa)" was used to align the 
shotgun data to the modified human reference GRCh37 (ftp://ftp.1000genomes. 
ebi.ac.uk/vol1/ftp/technical/reference/phase2_reference_assembly_sequence/) and 
to align the human mtDNA capture data to the revised Cambridge Reference 
Sequence (NC_01290) with parameters adjusted for ancient DNA (-n 0.01 -0 2 -1 
16500)". We developed a two-step algorithm called jivebunny (https://bioinf.eva. 
mpg.de/jivebunny) for de-multiplexing of the sequencing runs (Supplementary 
Information 3), retaining only fragments that were assigned to the correct 
library based on their index sequences for all of the downstream analyses. PCR 
duplicates were removed using bam-rmdup (version 0.6.3; https://github.com/ 
mpieva/biohazard-tools/) and fragments were filtered for read length (>35 bp) 
and mapping quality (MQ > 25) using SAMtools (version 1.3.1)4". 

The deamination of cytosine (C) to uracil (U) residues leaves characteristic 
C-to-T substitution patterns in ancient DNA molecules, which are particularly close 
to the alignment ends’° and thus provide evidence for the presence of authentic 
ancient DNA in specimens” (Supplementary Information 2, 3 and Extended 
Data Fig. 1). We evaluated the frequency of these nucleotide substitution patterns 
characteristic of ancient DNA using an in-house Perl script. For determining the 
coverage of the nuclear genomes, we counted the number of bases with a base 
quality of at least 30 (BQ > 30) in the fragments that overlapped highly mappa- 
ble regions of the autosomes of the human genome (Map35_100% of ref. 2) and 
divided this number by the total length of those regions. We determined the sex 
of the Neanderthal individuals by counting the number of fragments that aligned 
to the X chromosome and the autosomes (Extended Data Fig. 3). 
Contamination estimates. We used four methods to estimate the proportion 
of present-day human DNA contamination in the final dataset (Supplementary 
Information 4). We estimated the proportion of mtDNA contamination by 
present-day human DNA using two different sets of positions. First, we re-aligned 
the shotgun data of all HiSeq runs to the revised Cambridge Reference Sequence 
(rCRS, NC_012920) using BWA”. We then counted the number of fragments 
overlapping 63 positions at which 18 published Neanderthal mitochondrial 
genomes’”!?4346 differ from those of 311 present-day humans”. In the second 
approach, we determined positions in the reconstructed mtDNA genomes of each 
of the five Neanderthals that are specific for that Neanderthal compared to 311 
present-day humans. To mitigate the effect of deamination for both approaches, 
we ignored the alignments on the forward or reverse strands at positions where 
the informative base was a C ora G”’. 

We estimated the extent of present-day human DNA contamination in the 
autosomal sequences of the five late Neanderthals using a maximum likelihood 
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approach described in ref. 1, based on all sites covering informative positions in 
the nuclear genome at which humans carry a fixed derived variant when compared 
to the great apes. Additionally, we estimated levels of present-day human DNA 
contamination using an ancestry model in which each low-coverage Neanderthal 
traces a portion of the genome either from a high-coverage uncontaminated 
Neanderthal, or from a modern human population related to present-day non- 
Africans. We built a two-source qpAdm” model in which one part of the ancestry 
was modelled as being closely related to the high-coverage genome of Vindija 33.19 
Neanderthal, and the other source of ancestry was modelled as being most closely 
related to the Dinka population of the Simons Genome Diversity Project (SGDP)”!. 
We estimate the proportion of male contamination for the three female individ- 
uals, Les Cottés Z4-1514, Goyet Q56-1 and Vindija 87 by counting the number 
of fragments aligning to the unique regions of the Y chromosome and divided it 
with the number of fragments that would be expected if the individual was a male 
(Supplementary Information 4 and Supplementary Table 4.1). 

mtDNA and Y chromosomes of Neanderthals. We re-aligned the shotgun data 
of all HiSeq runs to the Vindija 33.16 mitochondrial genome (AM948965)*? using 
BWA“ to reconstruct complete mitochondrial genomes of five Neanderthals. 
A consensus base was called at each position covered by at least three fragments 
and where at least two-thirds of fragments had an identical base’” (Supplementary 
Information 5). We aligned the reconstructed mitochondrial genomes of the five 
late Neanderthals to the mtDNA genomes of 18 Neanderthals™!*43-4649, 311 
present-day humans”, 10 ancient modern humans*!°76°°-*, 3 Denisovans?>*, 
a hominin from Sima de los Huesos!” and a chimpanzee* using MAFFT*” and 
calculated the number of pairwise differences among mitochondrial genomes using 
MEGAO**, Bayesian phylogenetic analysis was performed using BEAST version 
2.4.5 to infer the time to the most recent common ancestor of all Neanderthal 
mitochondrial genomes (Supplementary Information 5). Radiocarbon dates of 
Neanderthals and ancient modern humans were used as calibration points for 
the molecular clock and we used jModelTest2” and marginal likelihood esti- 
mation analysis°!” to choose the best fitting substitution, clock and tree model 
(Supplementary Information 5). 

For reconstructing the phylogeny of the Neanderthal Y chromosomes, we 
compared the two male Neanderthal individuals, Mezmaiskaya 2 and Spy 94a, 
to 175 present-day human males from the SGDP”! and two present-day humans 
with haplogroup A00*°. For processing the Y chromosomal data of the presented 
Neanderthal individuals, we followed the processing of the previously published 
Neanderthal Y chromosome analysis!” and used the described parameters to call 
bases and infer genotypes (Supplementary Information 5). 

Set of filters for nuclear data analyses. As the data presented in this study were of 
low-coverage and contained elevated C-to-T substitutions, we investigated spuri- 
ous correlations stemming from the properties of the data using D-statistics'°*™ 
(Supplementary Information 6). All of the sampling schemes, except random read 
sampling and then restricting to transversion polymorphisms, as well as simula- 
tion of UDG treatment, resulted in significant spurious correlations between indi- 
viduals (Supplementary Information 6 and Supplementary Table 6.1). Therefore, 
we applied random read sampling at each position in the genome that was covered 
for the low-coverage individuals. To diminish the effect of present-day human 
DNA contamination and enrich for the endogenous fragments’, we further 
selected the fragments that showed C-to-T substitutions relative to the human 
reference genome at the first three and/or the last three positions, that is, putatively 
deaminated fragments. The newly generated sequencing data of Mezmaiskaya 1’, 
an approximately 60,000-70,000-year-old sample of a Neanderthal from Russia, 
were processed in the same way as the data of the other low-coverage individuals. 
For the high-coverage genomes of Altai” and Vindija 33.19” Neanderthals, as 
well as the Denisovan” individual, snpAD genotype calls (http://cdna.eva.mpg. 
de/neandertal/Vindija/VCF/) were used. For comparison to modern humans, 
we used hetfa files of 263 present-day human genomes of the SGDP”! and 
snpAD genotype calls of three high quality ancient modern humans, Ust’-Ishim, 
an approximately 45,000-year-old modern human from Siberia!®; Loschbour, 
an around 8,000-year-old hunter-gatherer from Luxembourg”*; and LBK, an 
approximately 7,000-year-old farmer from Stuttgart”®. Variant sites across all 
genomes were extracted (https://bioinf.eva.mpg.de/heffalump) and converted 
into an input format for AdmixTools (version 4.1)© or exported into combined 
VCF files. We further restricted all analyses to bi-allelic sites in the genome 
covered by at least one low-coverage Neanderthal genome and to transversion 
polymorphisms. 

Principal component analysis. We carried out principal component analysis 
using genomes of Vindija 33.197, the Altai Neanderthal’ and the Denisovan 
individual” to estimate the eigenvectors of the genetic variation and then projected 
the low-coverage Neanderthal genomes onto the defined plane. This allowed us 
to explore the relationship of low-coverage Neanderthal genomes relative to the 
high-coverage Neanderthal genomes (Extended Data Fig. 4). 
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Lineage attribution and average sequence divergence of late Neanderthals. We 
followed an approach described in ref. 66 based on the sharing of derived alleles 
with a certain hominin group to determine more precisely to which hominin group 
the nuclear genomes of Les Cottés Z4-1514, Goyet Q56-1, Spy 94a, Mezmaiskaya 
2 and Vindija 87 are most closely related. We investigated the state of DNA frag- 
ments overlapping the positions at which the high-coverage genomes of the Altai 
Neanderthal’, the Vindija 33.19 Neanderthal’, the Denisovan individual”® and a 
present-day African (Mbuti, HGDP00982)’ differ from those of the great apes 
(chimpanzee, bonobo, gorilla and orangutan). We then determined the proportion 
of fragments for each of the low-coverage Neanderthal genomes that supported 
the derived state of each of the branches in the phylogenetic tree relating the four 
high-coverage genomes® (Extended Data Table 3, Supplementary Information 7 
and Supplementary Table 7.1). 

We estimated the divergence of the five late Neanderthal genomes along the 

lineage from the ancestor shared with the chimpanzee and the high-coverage 
genomes of the Altai and Vindija 33.19 Neanderthals, the Denisovan individual, 
or a present-day human from the B-panel of ref. 2, using the triangulation method 
previously applied to a number of ancient genomes’*?"+°°, We calculated how 
many of the substitutions that were inferred to have occurred from the human- 
chimpanzee ancestor to the high-coverage genomes that occurred after the split 
from the low-coverage genome (Supplementary Information 7 and Supplementary 
Tables 7.2, $7.3). Standard errors were computed by a weighted block jackknife®” 
with a block size of 5 million base pairs (5 Mb) across all autosomes. 
Split times of late Neanderthals and the neighbour-joining tree of nuclear 
genomes. We conditioned on the heterozygous sites in the two high-coverage 
Neanderthals, and then computed the fraction of sites that show the same derived 
allele in randomly sampled fragments of the low-coverage individuals in order 
to estimate the split times between the low-coverage Neanderthals and the Altai’ 
and Vindija 33.19” Neanderthals (Extended Data Table 4 and Supplementary 
Information 8). In this F(A|B) statistich”°, the expected value depends only on 
the demography of the high-coverage individuals from which heterozygous sites 
are determined. We performed these analyses on all randomly sampled fragments, 
and on deaminated fragments only (Extended Data Table 4 and Supplementary 
Information 8). 

We used the low-coverage nuclear genomes of Les Cottés Z4-1514, Goyet Q56-1, 
Spy 94a, Mezmaiskaya 2 and Mezmaiskaya 1, the high-coverage genomes of 
Vindija 33.19, Altai Neanderthal, the Denisovan individual and 12 present-day 
humans from ref. 2 for the construction of a neighbour-joining tree. We counted 
the total number of transversions between all pairs of individuals and the human- 
chimpanzee common ancestor”. We constructed a neighbour-joining tree 
based on the pairwise number of transversions in windows of 5 Mb across all auto- 
somes between all individuals with 1,000 bootstrap replications (Supplementary 
Information 8). The tree was constructed as implemented in the R-package 
phangorn® and visualized with FigTree (version 1.4.2) (http://tree.bio.ed.ac.uk/ 
software/figtree/). 

Inferring the relationships between Neanderthals. We used D-statistics 
to investigate the population relationships among Neanderthal individuals. We 
co-analysed all low-coverage Neanderthal genomes with the high-coverage 
genomes of the Altai” and Vindija 33.19” Neanderthals. We used the whole-genome 
alignments of the chimpanzee, orangutan and rhesus macaque to the human 
reference genome”? to infer the ancestral states for the analyses of D(Altai, 
Vindija 33.19; Neanderthal, outgroup). Furthermore, we used the genomes of the 
Dinka and Mbuti individuals from the SGDP”' as outgroups for the statistics of 
D(Neanderthall, Neanderthal2; Neanderthal3, outgroup). The standard errors 
were computed using a weighted block jackknife®” with equally sized blocks of 
5 Mb over all autosomes. We further restricted these analyses to bi-allelic sites in 
the genome covered by at least one low-coverage Neanderthal and transversion 
polymorphisms. 

Inferring the relationship to the introgressed Neanderthals in present-day 
and ancient modern humans. We analysed the low-coverage late Neanderthal 
genomes together with the high-coverage genomes of the Altai” and Vindija 33.19 
Neanderthals’, the high-coverage genomes of the Denisovan individual” and 263 
present-day humans of the SGDP”! (Supplementary Information 10). We included 
Ust’-Ishim, an approximately 45,000-year-old modern human from Siberia’; 
Loschbour, an around 8,000-year-old hunter-gatherer from Luxembourg”®; and 
LBK, an around 7,000-year-old farmer from Stuttgart”® to study the differences 
among Neanderthals in their proximity to the introgressed Neanderthal DNA 
detected in ancient modern humans. Analyses were restricted to transversion 
polymorphisms and to bi-allelic sites in the genome covered by at least one 
low-coverage Neanderthal. As above, D-statistics were used to infer the rela- 
tionships between individuals'**® and standard errors were computed using a 
weighted block jackknife®*“” over all autosomes (block size: 5 Mb). 


1,54,63 


Early modern human gene flow into late Neanderthals. We modelled admixture 
from modern humans into Neanderthals with an ascertainment scheme in which 
both the Denisovan individual and Altai Neanderthal were fixed for the ancestral 
allele and at least half of the alleles in present-day African populations are derived. 
We applied the method as previously described’? and estimated the date of recent 
early modern human admixture into Neanderthals to be around 10-100 gene- 
rations ago. At each single-nucleotide polymorphism in the genome, we consi- 
dered data from all Yoruba individuals from the 1000 Genomes Project”* covered 
by at least three fragments that passed a pre-defined set of filters. Furthermore, 
we restricted the analysis to sites in the genome at which >24 Yoruba indivi- 
duals as well as the Altai Neanderthal and Denisovan individual had allele calls 
(Map35_50% filter from ref. 2). The ancestral states were taken from the inferred 
ancestor of humans and chimpanzees (Ensembl Compara version 64)’*”°. We 
introduced a more complex demographic history that is loosely based on the model 
described in ref. 77 (details in Supplementary Information 11). 

Code availability. All software packages used for analysis are cited and are pub- 
licly available. All custom developed software is freely available at https://bioinf. 
eva.mpg.de/. 

Data availability. The aligned sequences have been deposited in the European 
Nucleotide Archive under accession numbers PRJEB21870 (Goyet Q56-1), 
PRJEB21875 (Les Cottés Z4-1514), PRJEB21881 (Mezmaiskaya 2), PRJEB21882 
(Vindija 87) and PRJEB21883 (Spy 94a). The mitochondrial consensus sequences 
reported in this paper are available in GenBank with accession numbers 
MG025536, MG025537, MG025538, MG025539 and MG025540. 
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Extended Data Figure 1 | Frequency of nucleotide substitutions at the genome with a mapping quality of at least 25 (MQ > 25) were used for this 
beginning and the end of nuclear alignments for the final dataset of analysis. Solid lines depict all fragments and dashed lines the fragments 
Les Cottés Z4-1514, Goyet Q56-1, Mezmaiskaya 2, Vindija 87 and Spy that have a C-to-T substitution at the opposing end (‘conditional’ C-to-T 
94a. Only fragments of at least 35 bp that mapped to the human reference substitutions). All other types of substitutions are marked in grey. 
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Extended Data Figure 2 | Fragment size distribution of fragments 
longer than 35 bp mapped to the human reference genome with 


MQ > 25 for each of the five late Neanderthals. All fragments are dashed lines. 
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reference genome (putatively deaminated fragments) are depicted with 
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of fragments aligning to the X chromosome and the autosomes. The fragments (in red) and for deaminated fragments only (in grey). 
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Extended Data Figure 4 | Principal component analysis of the low-coverage Neanderthals were projected onto the plane. Only 
genomes of Vindija 33.19, Altai, the Denisovan individual, five late transversion polymorphisms and bi-allelic sites were considered for 
Neanderthals and Mezmaiskaya 1. Genomes of the high-coverage the analysis, to a total of 1,010,417 sites as defined by the high-coverage 


archaics were used to estimate the eigenvectors of the genetic variation and genomes. PC, principal component. 
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Extended Data Table 1 | Amount of data generated for Les Cottés 24-1514, Goyet Q56-1, Mezmaiskaya 2, Vindija 87 and Spy 94a 


Fragments with terminal C-to-T substitutions to the reference 
All fragments 
genome 
Specimen i i 
pecime Number of ‘ Number of mapped Number of unique Obtained Number of mapped Number of unique Obtained 
sequenced Number o — a bp, ae a bp, nuclear sige bp, a ee ity bp, ‘aiclear 
7 fragments > 35 b 225 225, > >25, 
fragments e F Map35_100% Map35_100% coverage Map35_100% Map35_ 100% coverage 
bre mis he 1,776,108,129 978,981,412 253,134,472 121,336,498 271 59,777,476 41,804,332 1.00 
Goyet Q56-1 917,798,995 575,597,226 124,582,445 80,515,019 2.18 13,585,646 9,859,280 0.27 
Mezmaikaya 2 2,571,128,011 1,356,302,134 125,815,832 74,407,074 1.74 32,838,281 23,521,855 0.56 
Vindija 87 1,353,546,508 678,005,556 110,385,510 60,034,976 1.27 32,597,516 22,828,056 0.49 
Spy 94a 733,212,063 278,595,622 72,962,863 46,836,553 1.00 14,301,192 10,003,560 0.22 
The number of fragments after merging all of the sequencing libraries together is shown. The coverage of the nuclear genomes is determined by counting the number of bases with a base quality of at 
east 30 in fragments 


onger than 35 bp with MQ > 25 that overlapped highly mappable regions of autosomes of the human genome, and dividing that number by the total length of these regions. 
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Extended Data Table 2 | Relationship of the late Neanderthals and Mezmaiskaya 1 to the Altai and Vindija 33.19 Neanderthals calculated 
as D(Altai, Vindija 33.19, Neanderthal, outgroup) for all fragments and deaminated fragments, restricted to transversions 


outgroup) 
Vindija33.19 | Ls a a 30,276 | 10,399 19,048 | 6,506 
Vindija33.19 | Ls a a 30,340 | 10,362 19,077 | 6,471 
Vindija33.19 | Ls rs oil 29,427 | 10,028 18,597 | 6,302 
Vindija33.19 | bes Cones 24- 28,149 | 9,664 17,775 | 6,060 
Vindija33.19 | Ls Sip ae 25,856 | 9,003 16,281 | 5,664 
Vindija33.19 | Goyet Q56-1 33,208 8,673 | 2,423 
Vindija33.19 | Goyet Q56-1 33,269 8,689 | 2,428 
Vindija33.19 | Goyet Q56-1 32,188 8,485 | 2,340 
Vindija33.19 | Goyet Q56-1 30,735 8,125 | 2,239 
Vindija33.19 | Goyet Q56-1 28,312 7,483 | 2,092 
Vindija33.19 | Mezmaiskayal 20,392 8,379 3,845 
Vindija33.19 | Mezmaiskayal 20,516 8,438 3,842 
Vindija33.19 | Mezmaiskayal 19,860 8,126 3,736 
Vindija33.19 | Mezmaiskayal 18,904 7,758 3,586 
Vindija33.19 | Mezmaiskayal 17,433 7,129 3,327 
Vindija33.19 | Mezmaiskaya2 26,802 12,477 4,516 
Vindija33.19 | Mezmaiskaya2 26,781 12,471 4,481 
Vindija33.19 | Mezmaiskaya2 26,019 12,233 4,369 
Vindija33.19 | Mezmaiskaya2 24,921 11,700 4,210 
Vindija33.19 | Mezmaiskaya2 22,908 10,799 3,929 
Vindija33.19 21,416 6,558 | 1,895 
Vindija33.19 21,453 6,567 | 1,878 
Vindija33.19 20,814 6,354 | 1,824 
Vindija33.19 19,793 6,052 | 1,740 
Vindija33.19 18,179 5,584 | 1,624 
All late Neanderthals and Mezmaiskaya 1 are significantly closer to Vindija 33.19 than to the Altai Neanderthal, irrespective of the outgroup used. Blue denotes Z<« — 2. A total of 1,567,449 sites that 


are transversions among Neanderthals and for which at least one late Neanderthal has coverage were used. Chimp, chimpanzee; Orang, orangutan; Rhesus, rhesus macaque. 
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Extended Data Table 3 | The fraction of derived alleles among putatively deaminated fragments that each of the low-coverage individuals 
shares with the Altai Neanderthal, Vindija 33.19, the Denisovan individual and a present-day human genome 


Altai zo ae , Neanderthal- Human- 4 
a seria Neanderthal sae - ay Denisova Neanderthal sa ai 
: : (%) : : (%) (%) i 
Les Cottés Z4- 0.67 93.01 18.00 46.41 0.86 97.90 97.30 B22) 
1514 0.65-0.70 92.89-93.13] 17.69-18.32 46.06-46.77] [0.82-0.89] 97.83-97.98] [97.17-97.43 [3.07-3.38] 
Goyet Q56-1 0.60 94.38 16.47 53.55 0.80 98.17 97.72 2,52 
y ‘ 0.56-0.65 94.19-94.57] 15.93-17.02 52.91-54.18] 0.74-0.86] 98.04-98.29] [97.49-97.92 2.29-2.78 
Spy 94a 0.71 93.89 16.52 51.45 0.80 97.83 97.62 3.06 
PY 0.66-0.77 93.65-94.12] 5.90-17.17 50.70-52.21] [0.73-0.87] 97.66-97.99] [97.35-97.87 2.75-3.39 
Vindija 87 0.36 97.08 6.81 75.93 0.37 99.05 98.91 1.38 
J 0.33-0.38 96.97-97.19 6.53-7.09] 75.49-76.36] 0.34-0.40] 98.98-99.12] [98.79-99.02 1.24-1.53 
Wissen 0.65 92.78 17.91 44.65 0.83 97.74 97.53 3.38 
z y 0.62-0.69 92.62-92.93] 7.51-18.31 44.19-45.12] [0.79-0.88] 97.63-97.84] [97.36-97.68 3.18-3.59 
Mezinaiskava1 0.76 91.87 20.30 38.38 0.97 97.61 96.80 3.73 
Z y 0.72-0.80 91.67-92.06 19.81-20.80] 37.85-38.91] 0.91-1.03] 97.48-97.73] [96.58-97.00 3.49-4.00 
Vindiia 33.16 3.60 93.39 17.09 58.99 3.94 96.14 95.46 4.89 
J . [3.50-3.71] 93.18-93.59] 6.55-17.65 58.36-59.63] [3.81-4.08] 95.95-96.32] [95.15-95.74 4.56-5.24 
Vindija 33.25 3.04 94.17 16.69 60.81 3.41 96.53 95.79 3.77 
J . 2.94-3.15 93.95-94.37 16.10-17.30] 60.11-61.49] 3.28-3.55] 96.33-96.72] [95.46-96.09 3.46-4.12 
Vindija 33.26 3.49 93.54 17.04 59.65 3.70 96.25 95.40 4.56 
J E 3.38-3.62 93.31-93.77] 6.43-17.67 58.94-60.36] [3.55-3.85] 96.04-96.45] [95.04-95.72 4.21-4.95 
Feldhofer 1 4.71 94.38 20 50.00 1.80 90.63 96.00 0 
[2.40-9.01] 87.51-97.58 [8.86-39.13] 34.07-65.93] 0.50-6.33] 81.02-95.63] [80.46-99.29 [0-12.87] 
El Sidron 1253 1029) 90.99 17.50 42.86 4.35 95.24 97.44 2.33 
0.44-3.72 84.21-95.03] [8.75-31.95] 30.02-56.73] [2.22-8.34] 88.39-98.13] [86.82-99.55 [0.41-12.06] 
Denisova4d 2.63 2.31 2.13 2.04 71.43 96.15 10.00 90.32 
1.51-4.54 [1.06-4.94] [0.59-7.43] [0.56-7.14] [65.87-76.40] 91.86-98.23] [5.15-18.51] [80.45-95.49] 
Deni 8 1.60 6.75 ews 1.83 60.02 92.53 15.32 87.72 
ee [1.39-1.85] _ [6.13-7.43] [1.23-2.48] [1.37-2.45] [58.92-61.12] 91.66-93.32] [13.72-17.06 [86.01-89.25] 
The state of DNA fragments overlapping the positions at which the high-coverage genomes of the Altai Neanderthal, the Vindija 33.19 Neanderthal, the Denisovan individual and a present-day African 
(Mbuti, HGDP00982) differ from those of the great apes were investigated. Fragments longer than 35 bp with MQ > 25 and within the highly mappable regions of the genome that had terminal C-to-1 
substitutions reported in the Supplementary Table 3.2 were used. 95% binomial confidence intervals are provided in brackets. 
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Extended Data Table 4 | Time of separation of late Neanderthals and Mezmaiskaya 1 (A) from the ancestor with the high-coverage genomes 
of Altai and Vindija 33.19 Neanderthals, Denisovan individual and a present-day human (B), when measured in terms of time of split from 
the B individual (split A-B), or time from present (split-time + branch shortening) 


All fragments Deaminated fragments 
i B % Split A-B Split time 95% CI %  SplitA-B Splittime 95% CI 
F(A|B) (ky) + bs (kya) (kya) |F(AJB) (ky) __+ bs (kya) (kya) 

era Altai 35.2 35.0 157.5 144.6-160.7| 34.7 37.8 160.2 156.4-163.5 
Goyet Q56-1 Altai 458 “223 144.7 142.8-157.6] 35.9 22.2 144.6 141.5-159.3 
Mezmaiskaya 2 Altai 35.4 33.8 156.2  143.9-159.7] 34.2 40.2 162.6  158.7-166.5 
Spy 94a Altai 34.4 39.2 161.7 158.2-165.2| 33.6 43.1 165.5 160.4-186.2 
Vindija 87 Altai 35.2 35.4 157.8 144.7-161.1] 34.6 38.3 160.8  156.2-164.8 
Mezmaiskaya | Altai 34.8 37.4 159.9 — 156.1-163.0] 34.4 42.9 165.3 161.1-178.6 
eee Vindija 33.19] 35.3. 15.1 66.9 66.1-68.7 | 34.3 18.3 70.0 67.9-72.4 
Goyet Q56-1_ Vindija 33.19] 37.8 8.3 60.1 58.8-61.7 | 38.0 8.1 59.9  57.8-61.9 
Mezmaiskaya2 Vindija 33.19] 35.6 14.3 66.1 64.3-67.7 | 343 18.3 70.0 67.4-74.5 
Spy 94a Vindija 33.19] 36.9 10.4 62.2 60.6-64.3 | 372 99 62.7  59.1-64.8 
Vindija87 —-Vindija 33.19] 46.8 0.0 51.8 51.8-51.8 | 45.0 0.0 61.7 51.8-51.8 
Mezmaiskaya | Vindija 33.19] 31.6 42.0 93.8 81.5-96.9 | 30.6 46.9 98.7 94.5-102.9 
ahi Denisovan | 12.6 333.1 405.1 381.9-448.2| 12.2 346.7. 418.7. 390.4-470.8 
Goyet Q56-1 Denisovan | 12.8 328.6 400.6 377.3-438.0] 13.1 322.4 3943 367.1-434.0 
Mezmaiskaya2 Denisovan | 12.4 338.2 410.2 387.0-458.4] 11.9 362.0 434.0 398.9-483.8 
Spy 94a Denisovan | 12.5 336.0 407.9  384.7-451.6] 124 3394 411.3 382.4-470.8 
Vindija87 -Denisovan | 12.5 337.1 409.1  385.8-455.0] 12.2 346.7 418.7 390.9-470.2 
Mezmaiskaya 1 Denisovan | 12.3 341.6 413.6  389.2-463.5] 123 345.0 417.0 388.7-471.4 
Sea Mbuti 17.5 525.9 525.9 511.3-539.8| 17.2 549.3 549.3 533.2-565.4 
Goyet Q56-1  Mbuti 17.7 515.0 515.0 501.8-528.9| 17.6 520.8 520.8  502.6-540.6 
Mezmaiskaya 2 Mbuti 17.6 521.6 521.6  506.9-536.2) 174 5354 535.4  517.9-553.0 
Spy 94a Mbuti 18.0 502.6 502.6  488.7-517.2) 17.3. 542.7. 542.7. 518.6-566.1 
Vindija 87 Mbuti 17.6 523.0 523.0 509.1-537.6| 17.3 542.0 542.0 524.5-558.8 
Mezmaiskaya | Mbuti 17.8 512.8 512.8  498.9-528.1] 17.1 552.2 552.2 534.7-569.8 


blocks of 5 Mb. In order to obtain estimates of sp’ 
+ branch shortening (ka)). 


Results reported for all fragments and for deaminated fragments only (Supplementary Tables 3.1, 3.2). 95% confidence intervals for F(A|B) values are estimated via weighted block jackknife, using 


it times from present, we correct ‘split A-B’ by adding the age of the high-coverage B-individual estimated from branch-shortening (column: split time 
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Moving magnetoencephalography towards 
real-world applications with a wearable system 


Elena Boto!*, Niall Holmes!*, James Leggett!*, Gillian Roberts, Vishal Shah’, Sofie S. Meyer**+, Leonardo Duque Mufioz’, 
Karen J. Mullinger!°, Tim M. Tierney’, Sven Bestmann**, Gareth R. Barnes’s, Richard Bowtell's & Matthew J. Brookes! 


Imaging human brain function with techniques such as 
magnetoencephalography’ typically requires a subject to perform 
tasks while their head remains still within a restrictive scanner. 
This artificial environment makes the technique inaccessible to 
many people, and limits the experimental questions that can be 
addressed. For example, it has been difficult to apply neuroimaging 
to investigation of the neural substrates of cognitive development 
in babies and children, or to study processes in adults that require 
unconstrained head movement (such as spatial navigation). Here 
we describe a magnetoencephalography system that can be worn 
like a helmet, allowing free and natural movement during scanning. 
This is possible owing to the integration of quantum sensors”, 
which do not rely on superconducting technology, with a system 
for nulling background magnetic fields. We demonstrate human 
electrophysiological measurement at millisecond resolution 
while subjects make natural movements, including head nodding, 


Photo-detector 


Rubidium 
atoms 


Laser beam 
Intensity 


Figure 1 | A new generation of MEG system. a, A conventional 
275-channel cryogenic MEG system. Weighing about 450 kg, the system 

is fixed and cumbersome and subjects must remain still relative to the 
fixed sensor array. b, Schematic illustration of zero-field resonance in an 
OPM sensor. Top, operation in zero-field; bottom, Larmor precession 
when an external field (B-field) impinges on the cell and the transmitted 
light intensity is reduced. c, A commercial OPM sensor made by QuSpin. 
The geometry used is illustrated by the coloured axes where B, is the radial 


stretching, drinking and playing a ball game. Our results compare 
well to those of the current state-of-the-art, even when subjects make 
large head movements. The system opens up new possibilities for 
scanning any subject or patient group, with myriad applications 
such as characterization of the neurodevelopmental connectome, 
imaging subjects moving naturally in a virtual environment and 
investigating the pathophysiology of movement disorders. 
Magnetoencephalography! (MEG) allows direct imaging of human 
brain electrophysiology by measurement of magnetic fields gener- 
ated at the scalp by neural currents. Mathematical analysis of those 
fields enables the generation of 3D images that show the formation 
and dissolution of brain networks in real time. MEG measurements 
of brain activity are currently made using an array of superconducting 
sensors placed around the head’. These cryogenically cooled sensors 
have femtotesla (fT) sensitivity, which is needed to detect the weak 
magnetic fields produced by the brain. Unfortunately, the requirement 


field component, B; the tangential field component and B, the direction 
along which the laser beam is oriented. d, Our prototype OPM-MEG 
system helmet. The helmet weighs 905 g and is customized so that the 
sensors (which in this prototype cover only the right sensorimotor cortex) 
are directly adjacent to the scalp surface. The subject is free to move their 
head. The measured radial field direction for the sensors is illustrated by 
the red arrows. 
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Birmingham B15 2TT, UK. °Sobell Department for Motor Neuroscience and Movement Disorders, UCL Institute of Neurology, University College London, Queen Square House, Queen Square, 
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Figure 2 | OPM-MEG results. a, Results when the subject was asked to 
remain still. b, Results when the subject was moving. c, Data collected 
using a cryogenic MEG instrument (SQUIDs) for comparison. In each 
row, the left panel contains a histogram showing movement of three 
fiducial markers on the subject’s head. The inset graph shows the change 
in marker positions over a typical experiment; different colours show 
movement in three Cartesian axes of the three markers. The middle panel 
shows the change in beta band power due to finger abduction (pink) 
overlaid onto axial, sagittal and coronal slices of the anatomical MRI. The 
functionally active region overlays contralateral sensorimotor cortex. 

In the right panel, a time-frequency spectrogram depicts changes in 


for cooling means that sensors must be housed within a liquid helium 
dewar with a vacuum space separating sensors from the scalp. MEG 
systems are therefore cumbersome (Fig. 1a) and sensor positions are 
fixed in a one-size-fits-all helmet. Any motion of the head relative to 
the sensors reduces data quality markedly: even a 5-mm movement can 
prevent use of the data>. Further, the brain-to-sensor distance, which 
is substantial in adults (about 3 cm), is increased markedly in subjects 
with small heads, reducing the available signal because the magnetic 
field decreases with the square of the distance between the source and 
the sensor. These characteristics make participation in MEG studies 
challenging for many subject groups, including infants and many 
patients. They also make the MEG scanner environment unnatural, 
and limit the experimental paradigms that can be studied. Here, we 
describe a transformative MEG technology that can be worn on the 
head during movement. This opens up the possibility for non-invasive 
mapping of human electrophysiology across all ages and patient groups, 
with subjects who are free to move and interact with the real world. 
At the core of our system is an array of optically pumped magneto- 
meters (OPMs)—magnetic field sensors that rely on the atomic 
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neural oscillations during finger abduction. The inset graph shows the 
characteristic beta band response for finger abduction (blue) and rest 
trials (red). In all cases the results are averaged over trials and experiments 
and the shaded region shows s.e. over six experiments. SIR ranged from 
4.3 to 8.2 for static OPM measures, 4.2 to 5.8 for moving OPMs and 4.9 

to 7.9 for the cryogenic system. Further analysis (Extended Data Fig. 2 
and Supplementary Information section 2) showed that the OPM system 
outperformed the cryogenic system in terms of both spatial resolution 
and robustness across experiments. Temporal resolution was quantified at 
130 Hz. 


properties of alkali metals. These sensors have undergone marked 
development in recent years***° and are well suited to MEG meas- 
urements!-!® In our system, each sensor contains a 3 x 3 x 3-mm? 
glass cell containing *’Rb vapour, which is heated to approximately 
150°C. A 795-nm circularly polarized laser beam, tuned to the D1 
transition of rubidium, is used to spin-polarize the atoms, and the 
intensity of laser light transmitted through the cell is detected using 
a photodiode. In zero magnetic field, the spin magnetic moments 
align with the beam, and transmission of laser light to the photodiode 
is maximized. However, a magnetic field perpendicular to the beam 
causes Larmor precession, rotating the magnetic moments away from 
alignment. This causes a measurable drop in light transmission. The 
resulting effect is a zero-field resonance (Fig. 1b), which acts as a 
sensitive magnetic field indicator. 

Each sensor is an integrated unit (Fig. 1c) with a noise level compara- 
ble to that of a superconducting quantum interference device (SQUID; 
about 15fT per V Hz) and a dynamic range of +1.5nT. Although the cell 
is heated, sensors can be mounted on the scalp because their external sur- 
faces remain close to body temperature. Our prototype system (Fig. 1d) 
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Figure 3 | Biplanar fingerprint-coil system for removing remnant 
static magnetic fields. a, Schematic of the coils, which are confined to 
two planes surrounding a 40 x 40 x 40-cm? region of interest in which 
the head is allowed to move. The five separate layers represent wire paths 
that generate fields B,, By, Bz, dB,/ dz and dB,/dz. b, Bar chart showing 
field magnitude with and without field nulling. Inset images show spatial 
field variation of B,. The static field was reduced from 23 + 1 nT to 

1.6 + 1.0nT. dB,/dz measured across a 10-cm region spanning the width 


comprised an array of sensors that were mounted in a 3D-printed 
‘scanner-cast. The scanner-cast’? was designed using an anatomical 
magnetic resonance imaging (MRI) scan, such that the internal surface 
snugly fits the subject’s head, while the external surface accommodates 
the OPMs, which were positioned over the right sensorimotor cortex. 
Four additional reference sensors, sited near the subject’s head, were 
used to measure background interference. 

As a first demonstration, we measured electrophysiological activity 
in the right sensorimotor cortex during visually cued abduction of the 
left index finger. This task robustly elicits a reduction in endogenous 
beta band (13-30 Hz) oscillations during movement and a rebound 
(increase above baseline) following movement cessation!’. Although 
simple, this task has been applied widely, with beta modulation being 
used as a marker of brain plasticity'*!”, psychosis”*”! and white matter 
degradation. A single experiment comprised 50 trials, each involving 
1s of finger abduction and 3s of rest. Fifty dummy’ trials (during 
which the subject did nothing) were interleaved to allow an estimate 
of baseline activity. A subject undertook the experiment twelve times: 
six during which they kept as still as possible, and six during which 
they made natural head movements, including nodding and shaking, 
stretching and drinking. Head motion was measured using an Xbox 
Kinect camera, which tracked the movement of three fiducial markers 
on the head. We also undertook the same static set of experiments using 
a cryogenic MEG system. 

Figure 2 shows OPM-MEG data measured when the subject kept 
still (Fig. 2a) and moved (Fig. 2b). Subject motion is shown in the left- 
hand column. In the static case, motion was less than --1 cm, whereas 
in the moving case it exceeded +10 cm. The middle column shows 
images” of beta modulation (pink overlay) and the right-hand 
column shows a time-frequency spectrogram (TFS) of oscillatory 
change. In the TFS, blue indicates a decrease in oscillatory amplitude 
relative to baseline whereas red indicates an increase. Line plots of beta 
amplitude are shown in the inset. Equivalent data from the cryogenic 
system are shown in Fig. 2c, where movement was (necessarily) con- 
strained to less than 2mm. OPM-MEG performed consistently across 
experiments with the characteristic beta decrease and post movement 
rebound clearly delineated and localized to the sensorimotor cortex. 
Despite an order of magnitude increase in head movement, there was 
no significant difference in signal-to-interference ratio (SIR) between 
the moving and static runs (P=0.39; two-sided Wilcoxon sum-rank 
test) and no correlation between the degree of movement and response 
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of the head (green box in b) was reduced from 10nT m“! to 0.28nT m!. 


c, Top, output of 13 OPMs over time while the subject nods their head 
(bottom, head movement). Blue and red traces show the cases where the 
field nulling system was on and off, respectively. Note that without field 
nulling the OPMs saturate during head movement, whereas with nulling 
the sensors continue to work, leaving an artefact that is comparable in 
magnitude to that due to an eye-blink. 


size (Extended Data Fig. 1 and Supplementary Information section 1). 
The spatial resolution of the OPM system was better than that of the 
cryogenic system, even with only 13 sensors (Extended Data Fig. 2 and 
Supplementary Information section 2). These data, along with a similar 
analysis of evoked responses (Extended Data Fig. 3 and Supplementary 
Information section 3), show clearly that the wearable system can be 
used to collect high-fidelity data even in the presence of large head 
movements. 

Critical to the wearable MEG system is a means to null the back- 
ground static magnetic field impinging on the OPMs. The system is 
housed inside a magnetically shielded room (MSR). However, the 
remnant Earth’s field in the MSR is about 25 nT, and it is spatially 
inhomogeneous. Any sensor movement through this field during an 
MEG recording would result in a field change much larger than the fields 
of interest, and would exceed the narrow (£1.5nT) operational range of 
the OPMs, rendering them inoperable. In addition, such changes can 
modulate the sensor gain (Extended Data Fig. 4 and Supplementary 
Information section 4). To ameliorate this problem, we constructed a set 
of bi-planar electromagnetic coils designed to generate fields equal and 
opposite to the remnant Earth's field, thereby cancelling it out. 

The coils were designed”>”® on two 1.6 x 1.6-m? planes, placed either 
side of the subject with a 1.5-m separation (Fig. 3a). Three coils gener- 
ated spatially uniform fields (B,, B, and B,) while two additional coils 
were used to remove the dominant field variations (dB,/dz and dB,/dz). 
In this way, unlike standard field-nulling technologies (for example, 
tri-axial Helmholtz coils), our system can account for spatial varia- 
tion of the field over a 40 x 40 x 40-cm? volume of interest enclosing 
the head. Furthermore, we were able to cancel all components of the 
field vector using coils confined to just two planes, hence retaining 
easy access to the subject. Four reference OPM sensors were coupled 
to the coils in a feedback loop to null the residual static field in the 
volume of interest. We achieved a 15-fold reduction in the remnant 
Earth’s field and a 35-fold reduction in the dominant field gradient 
(Fig. 3b). Figure 3c shows OPM measurements made during 7-cm head 
movements, with (blue) and without (red) field nulling. The results 
show that without field nulling, the OPM sensors failed (evidenced 
by the saturation). With field nulling, however, the OPMs were able to 
capture MEG data even while the head was moving (Supplementary 
Information section 5). 

The ability to map human electrophysiology non-invasively, with 
whole brain coverage and high spatiotemporal resolution is, to our 
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a_ Ball game paradigm 
7 


Figure 4 | An example ‘real world’ imaging paradigm. a, Experimental 
set-up: the seated subject continually bounced a table tennis ball off a bat 
for 10s. This was followed by a 10-s baseline period, during which they 
did nothing. This was repeated 29 times. b, Spatial signature of beta band 
oscillatory change during periods of playing the ball game compared to 
rest. Note the difference in localization compared to Fig. 2, with the beta 


knowledge, unique to MEG”. Alternative techniques either lack spa- 
tial specificity (for example, electroencephalography (EEG), which is 
also more susceptible to muscle artefacts (see Extended Data Fig. 5 
and Supplementary Information Section 6)) or provide only indirect 
access to brain function via metabolic processes (for example, func- 
tional magnetic resonance imaging (fMRI)). However, current MEG 
systems exclude many subject cohorts and experimental paradigms. 
Our system represents a step change for functional imaging. A wear- 
able instrument with scalp-mounted sensors means that subjects 
can be scanned at all ages, from babies to elderly patients, allowing 
imaging during key stages of cognitive development and decline. For 
example, it is well known that efficient communication between spatially 
separate cortical regions is key to healthy brain function, and that 
neural oscillations help to mediate such connectivity*”. However, little 
is known about how electrophysiological brain networks emerge during 
the early years of life. Our system can characterize those networks, and 
the spectro-temporal profile of the connectivities that underlie them 
(see Extended Data Fig. 6 and Supplementary Information section 7). 
This highlights the potential utility of OPM-MEG for characterizing 
the developmental connectome. 

A wearable system also opens doors to myriad neuroscientific 
investigations in which subjects can move naturally and interact 
with the real world. An example is given in Fig. 4, which shows brain 
activity elicited when the subject played a ball game (Fig. 4a). This 
naturalistic paradigm requires free, rapid and unpredictable head and 
arm movement; nevertheless, we were able to localize changes in beta 
oscillations to the arm and wrist area of motor cortex (Fig. 4b) and track 
the dynamics of oscillatory modulation (Fig. 4c). To evaluate the robust- 
ness of this measure, similar data were captured in two further subjects. 
Task-induced beta modulation, relative to baseline, was measured at 
61%, 80% and 70% in the three participants. This simple example 
demonstrates the power of wearable neuroimaging to accurately assess 
brain function during a real world visuomotor coordination task, 
adding a new dimension to previous work**. This example could readily 
be extended to examine, for example, the neural correlates of motor 
coordination, their maturation during neurodevelopment and their 
breakdown in movement disorders. Such studies using naturalistic 
stimuli are inaccessible to conventional neuroimaging scanners owing 
to the movement required to undertake the task. In sum, this technology 
has transformative potential across a range of neuroscientific and clinical 
applications where knowledge of brain electrophysiology is informative. 
Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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modulation localized to the arm and wrist area of sensorimotor cortex 
(distinct from the hand area in Fig. 2). c, Trial averaged time-frequency 
spectrogram (left) and amplitude of beta oscillations averaged over trials 
(right). The maximum head movement during this paradigm, assessed by 
the Kinect camera, was 6cm. 
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METHODS 


OPM-MEG system design and fabrication. System overview. An overview of our 
OPM-MEG system is shown in Extended Data Fig. 7a. The system comprised 
13 OPM sensors (QuSpin), which were mounted on the scalp surface (the scalp 
array), and a further 4 sensors (placed close to the head, but not on the scalp 
surface) which were used to make reference measurements (the reference array). 
Each sensor produces an analogue voltage output that is proportional to the 
magnetic field perpendicular to the laser beam (scaling=2.7 V nT~!). The sensor 
outputs were digitized at a sample rate of 1,000 Hz, using a 16-bit digital acquisition 
(DAQ) system (National Instruments) controlled using custom-written software 
in LabVIEW. The sensor arrays were housed inside a magnetically shielded room 
(MSR) to reduce environmental magnetic interference; all control equipment was 
kept outside the room to minimize its effect on MEG measurements. 

Prior to data acquisition, the reference array was used to identify the currents in 
the coils that best nulled the residual static magnetic field inside the MSR and its 
dominant first-order spatial variation. Reference sensors were located and oriented 
such that the three Cartesian components of the magnetic field (B,, B, and Bz), and 
the two dominant field gradients (dB,/dz and dB,/dz) could be measured close to 
the head (Extended Data Fig. 7b). Reference array measurements were input to a 
proportional integral derivative (PID) algorithm, which was used to control the 
currents in the field-nulling coils. This allows the calculation of currents which 
generate fields that are equal and opposite to those measured by the reference array, 
thus attenuating the static field over the volume spanning the subject’s head. This 
step is key if the head is free to move during MEG data acquisition, as without 
field nulling, even small changes in head position or orientation (for example, a 4° 
rotation in a 25-nT field) would produce field variations large enough to render 
the OPMs inoperable (Fig. 3c). 

During data acquisition, all 17 sensors (13 sensor scalp array and 4 sensor 
reference array) were operated simultaneously, with the reference sensors (which 
are sufficiently far from the scalp to be insensitive to the neuromagnetic field) used 
to characterize temporally varying background magnetic interference, which was 
later regressed from the signals recorded by the scalp-mounted sensors. 

The visual cue for paradigm control was controlled by a separate stimulus 
computer. This was coupled to a data projector that projected the cue image 
through a waveguide onto a back-projection screen positioned ~40 cm in front 
of the subject. The stimulus computer also generated TTL trigger pulses of 1-s 
duration denoting the onset of the visual cues and the start of the rest periods. 
These trigger signals were input to the DAQ and sampled, simultaneously with 
the OPM data, at 1,000 Hz. Throughout data acquisition, an Xbox Kinect camera 
(Microsoft) was used to measure head movement. Video data were captured using 
a third computer, which also recorded the trigger channels so that movement data 
could later be analysed on a trial-by-trial basis. 

OPM sensors. The fundamental building block of our system is the OPM sensor. 
We used compact commercially available sensors*!! (Fig. 1c), allowing a large 
number to be located flexibly on the scalp surface. Each OPM sensor head contains 
a semiconductor laser for optical pumping, optics for laser beam conditioning, a 
3 x 3 x 3-mm?*’Rb vapour cell and a silicon photodiode for beam detection. The 
sensor head connects to a small electronics controller via a 5-m cable that is passed 
through a waveguide in the MSR. The sensor includes three on-board coils, which 
can be used to null static field components in the cell. Field changes (for example, 
due to neural currents in the brain) can be detected via the change in transmitted 
light intensity that they produce (Fig. 1b). The transmitted intensity shows a zero- 
field resonance, which is a Lorentzian function of the magnetic field components 
transverse to the laser beam, with a full width at half maximum of around 30 nT. 
For continuous field measurements, a sinusoidally modulated magnetic field of 
1-kHz frequency is applied perpendicular to the laser beam using the on-sensor 
coils. The depth of modulation of the transmitted light, which is monitored using 
a lock-in process, is sensitive to the magnitude of the field component along the 
modulation axis”*. The amplitude of the two field components perpendicular to 
the beam can be measured simultaneously by applying oscillating currents to two 
coils in quadrature. However, here only the radial field component was measured. 
Scanner-cast design and 3D printing. Pre-recorded MRI and cryogenic MEG data 
were used to inform the design and construction of the individualized scanner-cast. 
To ascertain the shape of the subject’s head, an anatomical MRI scan was acquired 
using a Philips 3T Ingenia MR system running a T-weighted gradient echo 
imaging sequence, with an isotropic resolution of 1mm and a high bandwidth 
to limit field-inhomogeneity-related distortion. A 3D mesh, representing the 
outer surface of the head and face, was extracted from this image and then used to 
define a custom-fitted helmet, in which the OPM sensors could be mounted!2. In 
a complete system, a large number of sensors would be spaced equidistantly across 
the entire scalp surface, giving whole brain coverage, but in our prototype system 
(used for the finger abduction and ball game paradigms), only 13 sensors were 
available and these had to be positioned for optimal coverage of the sensorimotor 


region. To determine the optimal OPM sensor positions, we carried out the visually 
cued finger abduction experiment on the same subject using a cryogenic MEG 
system, and localized the source position and orientation using a beamformer. 
Having computed the location and orientation of the dipolar source, we used a 
forward model to compute the radial magnetic fields at the scalp surface generated 
by this dipole. The positions of the scalp-array sensors were then chosen to sample 
these fields optimally'?. We assessed the effect of crosstalk between sensors, which 
may occur as a result of constructive interference of fields from adjacent sensors. 
For the sensor array used here we found these effects to be less than 3% when 
taking into account the radial field (Extended Data Fig. 8 and Supplementary 
Information section 8). See Supplementary Information section 7 for details of 
the scanner-cast that accommodated the 26 sensors that were used in a functional 
connectivity demonstration (Extended Data Fig. 6). 

Field-nulling coils. The remnant Earth’s field at the centre of our MSR is ~25 nT, 
and also shows significant spatial variation, with a gradient of ~10nT m™!. This 
means that translation through the inhomogeneous remnant field, or rotation with 
respect to the field vector, produces field changes at the sensors that are much larger 
than the neuromagnetic fields of interest (Fig. 3). Given the size of the remnant 
field and the narrow operational range of the OPMs (+1.5 nT), such movement is 
likely to take the sensors outside their operational range. For example, a rotation 
of less than 4° in a field of 25 nT can produce a change of more than 1.5 nT in 
the amplitude of the magnetic field along an OPM sensor's sensitive axis, thus 
rendering it inoperable. The use of the field-nulling coils to reduce the remnant 
static (Earth’s) field over a large central volume inside the MSR is therefore critical 
if the sensor array is to be operated without being rigidly fixed in position with 
respect to the MSR. 

Here, the remnant field was nulled using a set of coils positioned on either 
side of the subject (see Fig. 3a). Five different bi-planar coils were constructed 
independently to null B,, B, and B, and the two dominant field gradients dB,/dz 
and dB,/dz. In contrast to the tri-axial Helmholtz coil arrangement that is 
commonly used to null the remnant field inside an MSR, the bi-planar coil array 
allows cancellation of spatially varying fields and also does not significantly limit 
access to the subject, since the coil windings are confined to two vertical planes 
(rather than the three pairs of orthogonal planes that would enclose the subject 
when using three orthogonal Helmholtz coils). The bi-planar coil system therefore 
offers considerable advantages by eliminating the spatially uniform remnant field 
and its first-order spatial variation, while maintaining access to the subject. 

Magnetic fields from bi-planar coils*! were optimized for homogeneity using a 
harmonic minimization approach”. The current distribution J was confined to the 
two planes at z= +a in the region |x|,|y| <_L, and defined using the stream function 
Sas J=VS x 2. Sis parameterized as a two-dimensional Fourier series so that 


cos(n — —)1x 
s= > Qn ( + 6, sin x 
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Optimal values of the Fourier coefficients (Q,, Bn, Ym dm) were identified by 
exploiting the symmetry of the field distribution and then minimizing: 


SS [B(r:) ~ Bearget(ts) + wP 


t 


Where r; characterizes locations within the volume of interest at which a homo- 
genous field or field gradient (described by Biarget) is required, and P represents the 
power dissipated in the coil. The coefficient w can be used to adjust the weighting 
of the power term. Increasing w reduces the complexity of the wire paths to be 
fabricated. B(r;) was calculated using the relationship 


~ a. ap a7sinh cosh g- 
B- Ho like + ik yp) 2) - k ack) kya 


where B and § are the two-dimensional Fourier transforms of the field and stream 
function with respect to x(k,) and y(ky), andk,= (k: + 5). The upper and lower 


sinh and cosh terms correspond to the case of the stream function having the same 
or opposite sign on the two planes. Coils were designed to produce a homogeneous 
(within -£5%) field or field gradient over a 0.4 x 0.4 x 0.4-m? central region, which 
is large enough to span the range of sensor positions during experiments in which 
head movement is allowed. The wire paths for each coil span an area of 1.6 x 1.6m? 
and are layered on two planes of medium-density fibreboard separated by 1.5m. 
The coil wire paths and contours of the magnetic field or field gradient in a central 
plane between the two coils are shown in Extended Data Fig. 9 for each coil. The 
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field variation was calculated by applying the elemental Biot-Savart law to the 
digitized wirepaths (see Supplementary Information section 5 for further 
discussion). 

Motion detection and quantification. To quantify head movement, we used a simple 
optical tracking system based upon a Microsoft Kinect V1-2010 camera which was 
placed ~1 m in front of the subject. This camera provides a simultaneous 8-bit 
RGB video stream (640 x 480 pixel display, with 57° horizontal and 43° vertical 
field of view) and an 11-bit depth image, reconstructed from an infrared projected 
field. Data from both streams were captured using the MATLAB image acquisition 
toolbox, at 12 frames per second. 

A motion-tracking algorithm was used to track the positions of three black 
dots on the outer surface of the white scanner-cast. The algorithm was initiated 
via manual selection of the three dots in the first frame of the video. A threshold 
detect routine then identified the dots and their centres of mass in all subsequent 
frames. Pixel numbers were converted to absolute locations in three dimensions by 
integrating the video data with the depth field. In this way, we defined parameters 
describing the motion of the three markers throughout the experiment. To quan- 
tify motion, movement parameters for all three orientations and all three fiducial 
locations were concatenated. Figure 2 (left) shows these data as a histogram plotted 
on a logarithmic scale. 

Experimental method. Finger abduction paradigm. Experiments were carried out 
ona single subject (female, right-handed, age 27), who provided written informed 
consent (both to participation in the experiments and to release of photographs). 
The study was approved by the University of Nottingham Medical School ethics 
committee. The subject undertook multiple repeats of a visuomotor task”’. The 
paradigm comprised visual stimulation with a centrally presented, maximum- 
contrast, vertical square-wave grating (3 cycles per degree). The grating sub- 
tended a visual angle of 8° and was displayed along with a fixation cross on a grey 
background. In a single trial, the grating was presented for 1s followed by a 
3-s baseline period during which only the fixation cross was shown. During 
presentation, the participant was instructed to make repeated abductions of 
their left (non-dominant) index finger. Fifty of these trials were recorded in 
each experiment. Blocks of ten trials were interspersed with blocks of ten ‘rest’ 
trials. In these rest trials (also 4s in duration) the fixation cross remained on the 
screen, but no finger abduction was made. Averaging across the 50 ‘real’ and 50 
‘rest’ trials independently allowed assessment of SIR (see below). This experiment 
was repeated six times with the subject stationary and six times with the subject 
making natural movements. It was also repeated six times using a cryogenic MEG 
system. 

The ‘ball game’ paradigm. The subject was seated in the OPM-MEG system 
and asked to play a simple ball game in which they continually bounced a table 
tennis ball on a bat. Ten-second bursts of ‘ping-pong’ were interspersed with 10s 
of baseline activity (during which the subject was told to simply hold the bat and 
ball on their knee). This gave a total trial length of 20s, and the subject repeated 
this 29 times. Owing to the location of the OPM sensors in the scanner-cast, the 
experiment was undertaken with the subject’s non-dominant hand. Trials were 
cued by a second experimenter who was positioned inside the MSR throughout 
the experiment and gave verbal instructions to cue the game (they shouted either 
‘ping-pong’ or ‘rest’ to begin or stop the game). This experiment was undertaken 
once in the OPM-MEG system by three subjects. 

Cryogenic MEG system data acquisition. To compare OPM-MEG to cryogenic 
MEG recordings, we carried out the finger abduction experiment, in the same 
subject, using a 275-channel CTF MEG system (MISL) operating in third- 
order synthetic gradiometer configuration**. Data were acquired at a sampling 
frequency of 600 Hz and the subject was seated. Three electromagnetic position 
indicator coils were placed on the head as fiducial markers (at the nasion, left 
preauricular and right preauricular points). The locations of these fiducials were 
tracked continuously during the recording by sequentially energizing each coil 
and performing a magnetic dipole fit to these data. This allowed both continuous 
assessment of head movement throughout the measurement, and knowledge of 
the location of the head relative to the MEG sensors. Prior to the MEG recording, 
a 3D digitization of the subject’s head-shape, relative to the fiducial markers, was 
acquired using a 3D digitizer (Polhemus). Co-registration of the MEG sensor 
geometry to the anatomical MR image was achieved by fitting the digitized head 
surface to the equivalent head surface extracted from the anatomical image. The 
subject undertook the experiment six times and a different head digitization was 
acquired each time. 

Data processing: interference rejection. Following data collection, OPM-MEG 
data were processed to remove magnetic interference. The reference array 
sensors were located close enough to the scalp array to capture similar environ- 
mental interference, but sufficiently far away to be insensitive to the neuromagnetic 
fields of interest. This meant that the scalp and reference arrays could be used to 
synthesize a ‘gradiometer’ measurement whereby the scalp array data are de-noised 
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via regression of the reference array signals’”. The reference array data from all four 
channels were combined in a single (design) matrix and a regression was used to 
remove any linear combination of reference array (interference) signals from the 
scalp array (neuromagnetic) signals. We applied this correction to the raw (that 
is, unfiltered) data. 
Data processing: beamforming. Following interference rejection, all MEG data 
(OPM and cryogenic) were processed in the same way. Data were initially inspected 
visually, and any trials with excessive interference were discarded. This resulted 
in the loss of only one trial (from a cryogenic recording). A beamformer adaptive 
spatial filtering approach was then employed. 

Using a beamformer”*"?, an estimate of electrical source strength Qg(t) made 
at time t and a predetermined location in the brain is given by a weighted sum of 
sensor measurements such that 


Qo(t) = Wom(t) (1) 


Here m(t) is a vector of magnetic field measurements made at time f across all 
sensors (that is, 13 in our OPM system or 275 in the cryogenic system) and Wo is 
a vector of weighting parameters tuned to a predefined location in source-space 
and current dipole orientation, represented by the vector 8. The superscript T 
indicates a matrix transpose. 

The weighting parameters are derived on the basis of power minimization. The 
overall power in the output signal Qg(t) is minimized with the linear constraint 
that power originating from the location and orientation of interest (8) should 
remain. Mathematically, the beamformer problem can be written as: 


42 

nutes 

where 6, represents the source power, given by 6, = wac Wo, C represents the 

channel level data covariance matrix calculated over a time-frequency window of 

interest, and Lg is the lead field vector, which is a vector containing a model of the 

magnetic fields that would be measured at each of the sensors in response to a 

source of unit amplitude with location and orientation @. The (regularized) solu- 
tion to equation (2) is found analytically to be: 


subject to WeLo=1 (2) 


Wo= [LC +p} Le] LiC+ SZ} (3) 


» is a diagonal matrix representing the white noise at each of the MEG channels 
(which we approximate as the identity matrix) and ju is a regularization 
parameter™. Note that, in addition to source localization, the power minimiza- 
tion term has the desirable effect of reducing artefacts from, for example, muscles 
(Extended Data Fig. 10 and Supplementary Information section 9). 

We sought to examine beta band effects, and so the beamformer weights were 
constructed, using equation (3), with covariance matrix C computed using beta 
band (13-30 Hz) data over a time window spanning the entire experiment” (400s). 
In all cases (OPM and cryogenic MEG data) the regularization parameter was 
optimized to give the best SIR (defined as the standard deviation of the finger 
abduction trials divided by the standard deviation of the rest trials). The lead field 
was calculated using the analytical formulation first described by Sarvas**. Two 
other covariance matrices were constructed: C, was defined as the data covariance 
in an ‘active’ window. This spanned the 0s < t< 1s window (relative to trial onset) 
in the case of the finger abduction paradigm, and the 1s <t< 9s window (relative 
to trial onset) in the ball game paradigm. C, represented data covariance in a ‘con- 
tro? window (1s <t<2s for the finger abduction and 11s <t< 19s for the ball 
game). We then defined a pseudo-T-statistical contrast as 


_ WoCaWo — WaC-Wo 
2WaCcWo 


To 


Pseudo-T-statistics were computed at the vertices of a regular 4-mm grid spanning 
the entire brain. For each voxel, the orientation of each source was based on a 
non-linear search for the orientation that gave the maximum signal-to-noise 
ratio*’, This method allowed construction of 3D images showing the spatial 
signature of maximum change in beta power. These images, averaged across 
experiments, are shown in Fig. 2 (middle) and in Fig. 4b. 

Finally, to investigate the spectrotemporal signature of electrophysiological 
activity at the location of peak change, a time-frequency spectrogram (TFS) was 
derived. Here, the peak location/orientation, Opeak, was extracted from the beam- 
former images, and a time course of electrophysiological activity for that location 
derived using equation (1) (the data covariance for the weights calculation was 
expanded to a broad (1-150 Hz) frequency range and covered the full 400s of 
data collection). The resulting data were frequency filtered into 31 overlapping 
frequency bands, and a Hilbert transform was used to generate the amplitude 
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envelope of oscillations within each band. These envelope time courses were 
then averaged across trials, and experiments, and concatenated in the frequency 
dimension to generate a single TFS (shown in Fig. 2 (right) and 4c). The same 
method was used (with beta band filtered covariance for weights calculation) to 
examine beta band fluctuations. 

Code availability. The MATLAB code used to analyse the current study is available 
from the corresponding author on request. 

Data availability. The datasets generated during and/or analysed during the 
current study are available from the corresponding author on request. 
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Extended Data Figure 1 | Response magnitude as a function of subject 
movement. a, Beta envelopes for finger abduction trials (blue/red) and 
resting trials (black/green) in the presence of large movement (red/green) 
and small movements (blue/black). b, The response size (that is, the 
difference between the mean amplitude during the desynchronization and 
rebound periods) shown as a function of maximum movement during 
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a trial. Note that no measurable relationship was found. A significant 

(P= 0.0052, Pearson correlation) baseline shift was observed; this is likely 
to be a consequence of artefacts in the data generated by electrical activity 
in muscles controlling the naturalistic movements. See also Supplementary 
Information section 1. 
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Extended Data Figure 2 | Quantification of spatial and temporal 
resolution. a, A ‘seed’ location was selected in sensorimotor cortex (at 
the cross-hairs). Four thousand random ‘test’ locations (red squares), 
within 3 cm of the seed, were selected randomly and probed. Shared 
variance was measured between electrophysiological time courses at 

the seed and test locations. b, Left, correlation between the seed and test 
time courses plotted as a function of spatial separation. Right, source 
separation at which shared variance dropped to below 50%. In both cases 
the error bars show s.d. over test locations. This serves as an absolute 
quantification of spatial resolution. Note that the OPM array, when 
static, significantly outperforms SQUIDs (P= 0.002, Wilcoxon sum- 
rank test). c, Quantification of the robustness of the source orientation 
estimation. While source power can vary between experiments, source 
orientation relies only on the local orientation of the cortical sheet, and 
should therefore be the same across equivalent experiments. Here the 
histograms show the source orientation difference (as an angle) across 
runs for 4,000 locations of interest. Note for all three experiments 

(static OPMs (top), moving OPMs (middle) and SQUIDs (bottom)) the 
probability distribution peaks at zero as would be expected. The bar chart 
shows the probability of observing an angular discrepancy below 5°; note 


that the OPM array, when static, significantly (P< 0.05, permutation 

test) outperformed the SQUID array in terms of orientation robustness. 
Moving OPMs demonstrated the lowest robustness; however, this would 
be expected since the execution of natural movements differed across 
runs and therefore brain activity in the sensorimotor strip will also differ. 
The improvement in spatial specificity and robustness in our OPM- 

MEG system compared to a cryogenic (SQUID) system is likely to be a 
consequence of two factors: first, the closer proximity of the OPM sensors 
to the scalp provides higher signal-to-noise ratio in OPMs compared to 
SQUIDs; second, the scanner-cast ensures that, for each run, OPM sensors 
are in exactly the same location with respect to the brain. Cryogenic MEG, 
on the other hand, is subject to co-registration errors. d, Quantification 

of the OPM sensor’s frequency response, which also defines its temporal 
resolution. An OPM sensor was placed in a Helmholtz coil and a white 
noise current source applied to the coil. The blue line shows the Fourier 
spectrum of the current source, the green line shows the spectrum of 

the measured field. Note that sensitivity falls by 3 dB at 130 Hz, giving a 
temporal resolution of 7.7 ms. See also Supplementary Information 
section 2. 
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Extended Data Figure 3 | Evoked response analysis. a, Results when the trials in red. The shaded area shows standard error across six experiments. 
subject was asked to remain still. b, Results when the subject was moving. c, Direct comparison of the evoked response when the subject was asked 
Left panels show functional image: the overlay shows the spatial signature to remain still (red) and was moving (blue). No significant SIR difference 
of the 2-30 Hz component of the evoked response, overlaid onto axial, was found between static and moving runs (P= 0.24, two-sided Wilcoxon 
sagittal and coronal slices of the anatomical MRI. Right panels show the sum-rank test). See also Supplementary Information section 3. 


time course of the evoked response; finger abduction trials in blue, rest 
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Extended Data Figure 4 | Gain changes with static magnetic field. transforms of each data segment. The inset figure shows the height of the 
a, Raw OPM-MEG data recorded from a single channel during the gain 137-Hz peak for different segments. Note that the peak height changes as a 
experiment. Data were divided into nine segments (colour-coded here) _ function of static magnetic field. c, Fractional change in 6B, as a function 
corresponding to nine different static background magnetic fields (BS), of background field BS"), The blue circles show the measured data with 
ranging between —1.5nT and 1.5 nT. The inset plot shows the small the standard deviation over the six sensors. The green line shows a fitted 
oscillating field (5B,), applied (in this sensor) at 137 Hz using the radially Lorentzian function. See also Supplementary Information section 4. 


oriented on-sensor coil, which mimics neuromagnetic activity. b, Fourier 
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Extended Data Figure 5 | Comparison of EEG and OPM-MEG. 

a, Muscle tensing experiment. i, Channel montages for EEG (top) and 
OPM-MEG (bottom). Blue circles show EEG channels used; blue squares 
show MEG channels used; red stars denote channels used to create (ii); 
black circles indicate channels used for averages in (iii). ii, Time-frequency 
spectra showing fractional change in oscillatory amplitude, relative to 
baseline. The three plots show three separate channels, with the muscle 
artefact visible in the 0-1-s window, when jaw clenching took place. 

iii, Quantitative analysis of the magnitude of the artefact, which was 
measured to be about ten times larger in EEG. Error bars show s.d. 
across sensors. b, Finger abduction experiment. The four rows show 
OPM-MEG and EEG data with the subject stationary, followed by 
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OPM-MEG and EEG data with the subject making natural movements. 
The left-hand column shows movement parameters. The left and left- 
centre time-frequency spectra show absolute difference from baseline of 
the MEG (in fT) and EEG (in tV) signals for individual channels, in the 
finger abduction and resting trials, respectively. These results have been 
averaged across all six experiments in both modalities. The right and 
right-centre time-frequency spectra show equivalent visualizations for 

a single representative experiment. Notice that, with the head stationary, 
MEG and EEG show similar results. However with the head moving, EEG 
data suffer from artefacts generated by muscle activity, to which the MEG 
data are less susceptible. See also Supplementary Information section 6. 
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Extended Data Figure 6 | OPM-MEG derived functional connectivity. 
A single subject took part in an experiment in which 5 min of OPM- 
MEG data were acquired in the resting state (subject was told to ‘think of 
nothing’). The experiment was repeated twice and the results averaged. 
a, A 26-channel OPM scalp array, with OPM sensors positioned (using 
a scanner-cast) approximately to cover the left and right parietal lobes 
(red circles). MEG data were reconstructed in source space using a 
beamformer, on a 4-mm grid covering the entire brain. A seed location 
was selected in left sensorimotor cortex and functional connectivity 
between the seed and the rest of the brain computed using an amplitude 
envelope correlation measurement, with correction for signal leakage by 
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regression. b, Regions exhibiting the strongest functional connectivity 
to the seed location (in the beta frequency band). Note that, in addition 
to a region around the seed, functional connectivity is observed in the 
homologous regions of the opposite hemisphere. This reflects long-range 
functional connectivity within the sensorimotor network. c, Functional 
connectivity strength between left and right primary sensorimotor cortex, 
plotted as a function of frequency. Note that, as expected, functional 
connectivity between these regions is greatest in the beta band (13-30 Hz). 
d, An example of beta band envelopes from the left (blue) and right 
(red) sensorimotor cortices, derived from resting state data. See also 
Supplementary Information section 7. 
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Extended Data Figure 7 | An overview of the OPM-MEG system. 

a, Schematic showing an overview of system hardware. b, Positioning of 
the reference sensors relative to the head to allow measurement of the 
three Cartesian components of the magnetic field, and the two dominant 
spatial gradients of the field. Each sensor provides measurements of two 


LETTER 


Ee B® _ B®) 


components of the magnetic field that are perpendicular to the beam 
axis. Both components were measured for field nulling, but during 
experimental measurements only the component of the field along the 
long-axis of the sensor was measured. 
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Extended Data Figure 8 | Crosstalk characterization across an OPM measured as the ratio of fields generated by the perturbing sensor and 
array. a, Schematic 3D representation of the crosstalk simulation. The the base sensor at the position of the base sensor. This ratio is a periodic 
head surface is shown with two example sensors. The locations of all function of sensor rotation about the radial orientation; the minimum 
13 sensors are also indicated. We sought to characterize crosstalk between —_ interaction is shown in b, the maximum is shown in c. d, Experimentally 
all pairs of sensors in the array. b, c, Simulated crosstalk between sensors measured crosstalk matrix. See also Supplementary Information section 8. 
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Extended Data Figure 9 | Coil designs. Wire paths and field plots are 
shown for the five coils: i, B,; ii, By; iii, Bs iv, dB,/dz; v, dB,/dz. The 
upper portion of each part shows the wire paths for one (1.6 x 1.6-m7) 
plane of the bi-planar coil. Red and blue colours indicate clockwise 
and anticlockwise circulation of the current. The lower portion shows 
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contours of the field or field gradient strength over the 0.4 x 0.4-m? 

x-y plane located at the centre of the volume of interest (z= 0). For v, 
contours are shown in the x-z plane at y= 0). The field or gradient values 
are normalized to the value at x = y =z=0. Variation from the ideal field 
distribution is less than 5% over a 0.4 x 0.4 x 0.4-m? central volume. 
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Extended Data Figure 10 | Removal of muscle artefacts via over-regularized beamformer (which is analogous to dipole fitting) (left) 
beamforming. a, The montage of OPM-MEG channels used to measure and unregularized beamformer (right). Note that for unregularized 
muscle artefact data. b, A beamformer image, highlighting a location of beamforming the muscle artefact is supressed effectively. See also 
interest in right sensorimotor cortex. c, The time-frequency response Supplementary Information section 9. 


for the best OPM-MEG sensor. d, Reconstructed responses from the 
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A TRP channel trio mediates acute noxious 


heat sensing 


Ine Vandewauw!”, Katrien De Clercq!?*, Marie Mulier!’, Katharina Held!*%, Silvia Pinto!?, Nele Van Rans 
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Andrei Segal!, Thierry Voet*, Rudi Vennekens!”, Katharina Zimmermann’, Joris Vriens*§ & Thomas Voets!?s§ 


Acute pain represents a crucial alarm signal to protect us from 
injury. Whereas the nociceptive neurons that convey pain signals 
were described more than a century ago’, the molecular sensors 
that detect noxious thermal or mechanical insults have yet to be 
fully identified*-*. Here we show that acute noxious heat sensing 
in mice depends on a triad of transient receptor potential (TRP) 
ion channels: TRPM3, TRPV1, and TRPA1. We found that robust 
somatosensory heat responsiveness at the cellular and behavioural 
levels is observed only if at least one of these TRP channels is 
functional. However, combined genetic or pharmacological 
elimination of all three channels largely and selectively prevents 
heat responses in both isolated sensory neurons and rapidly firing 
C and A6 sensory nerve fibres that innervate the skin. Strikingly, 
Trpv1~/—Trpm3~'~Trpa1 —~’~ triple knockout (TKO) mice lack the 
acute withdrawal response to noxious heat that is necessary to avoid 
burn injury, while showing normal nociceptive responses to cold 
or mechanical stimuli and a preserved preference for moderate 
temperatures. These findings indicate that the initiation of the acute 
heat-evoked pain response in sensory nerve endings relies on three 
functionally redundant TRP channels, representing a fault-tolerant 
mechanism to avoid burn injury. 

Nociceptors involved in acute heat sensing have cell bodies in the 
dorsal root and trigeminal ganglia and extend their sensory endings 
into the skin, mucosa and internal organs”. In these nociceptor end- 
ings, a rise in temperature induces the firing of action potentials, which 
propagate towards the dorsal horn or sensory nucleus in the brain, 
where the sensory information is relayed to second-order neurons’. 
Heat-activated nociceptors are characterized by the expression of the 
heat- and capsaicin-activated TRP channel TRPV1’. However, although 
ablation® or silencing’ of TRPV1-expressing neurons in mice result in 
almost complete absence of heat nociception, Trpv1-knockout mice 
display only minor deficits in acute noxious heat sensing’”"', indicating 
that heat-sensitive nociceptors express additional molecular heat sen- 
sors. Although various other ion channels, including several other TRP 
channels and calcium-activated chloride channels of the Anoctamin/ 
TMEM16 family*®, have been put forward as molecular heat sensors, 
mice deficient in these various channels showed either no or relatively 
mild deficits in acute heat sensing!?"}S, Thus, the molecular basis of 
acute heat sensing has remained unresolved’. 

The heat-sensitive TRP channel TRPM3 is expressed in a large subset 
of TRPV1-expressing sensory neurons, and Trpm3~/~ knockout mice 
exhibit reduced heat sensitivity at cellular and behavioural levels'*'°. To 
investigate the consequences of combined elimination of TRPV1 and 
TRPM3, we analysed the heat responsiveness of Trpm3~/~ Trpv1/— 
double knockout (DKOM*/Y) mice. Using calcium imaging, we probed 
the thermal and chemical responsiveness of sensory neurons isolated 
from the trigeminal ganglia in primary cultures. Whereas responses 
to the TRPM3 agonist pregnenolone sulfate and to the TRPV1 agonist 


capsaicin were largely eliminated, the proportion of heat-responsive 
neurons was reduced only slightly compared to that of wild-type 
mice, with approximately 40% of the DKO™?’¥! neurons still exhibit- 
ing a robust heat-induced calcium response (Fig. la; Extended Data 
Fig. 1a; see Methods for identification of heat responders). We used 
a skin-nerve preparation to measure action potentials upon stimu- 
lation of the receptive fields of single nerve fibres!’, and recorded 
robust heat-evoked responses in 33% (6/18) of the mechanosensitive 
C fibres from DKO™*/V! mice, as compared to 49% (17/35) in wild- 
type mice (P=0.38; Fisher's exact test; Extended Data Fig. 1b). At the 
behavioural level, DKO™?/Y! mice exhibited vigorous heat avoidance 
responses in the tail-immersion and hot-plate assays, although with- 
drawal latencies were prolonged compared to the wild type (Extended 
Data Fig. 1c). Thus, combined elimination of TRPM3 and TRPV1 
results in only very mild deficits in heat responsiveness, implying the 
existence of one or more additional heat sensors in somatosensory 
neurons. 

We observed that most residual heat-sensitive neurons in the 
DKO™V1 mice also responded to allyl isothiocyanate (AITC, 50 uM; 
Fig. 1a), a pungent chemical that at this concentration causes selective 
activation of TRPA1!*. Moreover, in wild-type heat-sensitive neurons, 
we found extensive overlap of functional expression of TRPA1 with 
both TRPM3 and TRPV1 (Extended Data Fig. 1d). To test whether 
TRPA| contributes to residual heat sensitivity in DKO™?/V! mice, we 
first evaluated the effect of the selective TRPA1 antagonist HC030031, 
and found that it caused an almost complete loss of heat-induced 
calcium responses in DKO™*/Y! sensory neurons (Fig. 1b, g). 
Accordingly, we measured heat-induced inward currents in a subset 
(9 out of 23) of DKO™*/V! sensory neurons; these currents were inhi- 
bited by HC030031 and correlated with cellular responsiveness to AITC 
(Fig. 1c, d). Next, we produced Trpv1~/~Trpm3~’Trpal~‘~ triple 
knockout (TKO) mice (Extended Data Table 1), and compared them with 
the different double knockout (DKO) mouse lines for these three TRP 
channels (DKOM*/Y!, DKOM*/A! (Trpm3 = ~Trpal ~~) and DKOV!4! 
(Irpv1~/ Trpal ~~) mice). Sensory neurons isolated from TKO mice 
showed an almost complete loss of heat responses, which contrasts 
sharply with the presence of more than 40% heat responders in wild- 
type mice and in the three DKO mouse lines (Fig. le-g). 

Several lines of evidence indicate that the loss of heat sensitivity in 
TKO neurons is specifically due to the combined loss of the three TRP 
channels. Most importantly, reintroduction via transient transfection of 
TRPV1, TRPM3 or TRPAI into TKO neurons restored sensitivity to heat 
and to the respective channel agonists (Fig. 2a—c, f). Conversely, heat 
responses in wild-type sensory neurons were suppressed by an inhibitor 
cocktail consisting of the TRPA1 antagonist HC030031, the TRPV1 
antagonist AMG9810, and the TRPM3 antagonist isosakuranetin 
(Fig. 2d-f). Together, these results indicate that TRPA1, TRPV1 
and TRPM3 have critical but redundant roles in heat transduction: 
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Figure 1 | TRPA1 mediates TRPM3- and TRPV1-independent heat 
responses. a, b, Responses to heat (45°C) and AITC (50|1M) in sensory 
neurons from DKO™*’V! mice in the absence and presence of the TRPA1 
antagonist HC030031 (100\1M). WTT, wild-type; PS, pregnenolone 

sulfate; Caps, capsaicin. c, Example of a HC030031-sensitive heat- 
activated inward current in an AITC-sensitive DKO™*! neuron. d, Mean 
(+ s.e.m.) current densities for experiments as in c (n = 9 cells). e, f, Heat 
and ligand responses in sensory neurons from TKO and indicated DKO 
mice. Scale bars ina, b, e, f, 60 s/100 nM. g, Percentages of heat-responding 
sensory neurons in mice of the indicated genotypes, in the absence or 
presence of HC030031. The experiments in a, b, e, fand summary data 

in g are representative of the following numbers (7) of tested trigeminal 
neurons from N mice: wild-type: n = 296, N= 6; DKOM¥/41: 1 = 613, N=5; 
DKOVYA!: 4 =725, N=7; DKOMY): 1 = 1,353, N=9; TKO: n=720, 
N=11; WT+HC030031: n = 208, N=3; DKO™*/Y! + HC030031: n= 293, 
N=3; DRG neurons from wild-type: n = 133, N=3; TKO: n= 181, N=3. 
**** P < 1.00001; Fisher’s exact test with Holm-Bonferroni correction. 


functionality of at least one of the three channels is necessary and suf- 
ficient to sustain robust heat responses in sensory neurons. 

Other aspects of sensory neuron biology were unaltered in the 
TKO mice. First, functional expression of the cold-activated channel 
TRPM8 was preserved in the trigeminal ganglia of TKO mice 
(Extended Data Fig. 2a—c). Compared to wild-type neurons, TKO neu- 
rons showed a reduction in the fraction of cold responders (Extended 
Data Fig. 2c), consistent with the contribution of TRPA1 to cold 
sensing’. Second, RNA sequencing (RNA-seq) analysis of isolated 
sensory ganglia from wild-type, TKO and the different single and dou- 
ble knockout mouse lines for TRPA1, TRPM3 and TRPV1 did not 
reveal any global changes in mRNA expression profile (Extended Data 
Fig. 2d). In particular, the mRNA expression levels of other channels 
implicated in heat sensing, and of a previously defined set of about 200 
operational components of sensory neurons””, were not significantly 
altered in the TKO mice (Extended Data Fig. 2e). Considering that 
heat-sensitive neurons constitute a large fraction of the total sensory 
neuronal population in wild-type mice (about 60%; Fig. 1g), these 
results argue against important alterations at the transcriptional level 
that would provide an obvious explanation for the striking deficit in 
heat sensing in the TKO neurons. 

A recent study”! identified a population of TRPM3- and TRPV1- 
negative heat-sensitive sensory neurons, and concluded that these 
neurons rely on TRPM2 for their heat responses. An important piece 
of evidence to support the involvement of TRPM2 was the finding 
that TRPM3- and TRPV 1-independent heat responses were enhanced 
by H,O2, a known TRPM2 activator. However, as TRPA1 is also sen- 
sitive to H,O,”*, TRPA1 may contribute to the H2O.-sensitive heat 
responses in sensory neurons. Using DKO™*/"! sensory neurons, we 
confirmed strong potentiation of TRPM3- and TRPV1-independent 
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Figure 2 | Properties of TKO sensory neurons. a-c, Heat- and ligand- 
induced calcium responses of TKO trigeminal neurons transfected with 
TRPV1, TRPM3 or TRPA1. d, e, Responses of wild-type trigeminal neurons 
to heat in the absence (d) and presence (e) of an inhibitor cocktail of the 
TRPV1 antagonist AMG9810 (5|1M), the TRPM3 antagonist isosakuranetin 
(5M) and the TRPA1 antagonist HC030031 (100 1M); after the heat 
stimulus and wash-out of the inhibitor cocktail, responses to the indicated 
agonists (pregnenolone sulfate, 401M; AITC, 501M; capsaicin, 1|1M) were 
conserved. Scale bars in a-e, 60s/100 nM. f, Percentages of heat-responding 
sensory neurons in transfected TKO neurons and in wild-type neurons, 

in the absence and presence of the inhibitor cocktail. The experiments in 
a-e and summary data in f are representative of the following numbers 

(n) of tested trigeminal neurons from N mice. TKO mock: n = 277, 

N=5; TKO + TRPV1: n= 130, N=5; TKO + TRPM3: n= 189, N=4; 
TKO + TRPAI: n= 289, N=6; wild-type control: n = 276, N=6; 

wild-type + inhibitor cocktail: n= 199, N=5. 


heat responses by H2O,, including in heat-insensitive neurons, which 
gained heat sensitivity in the presence of HzO, (Extended Data 
Fig. 3a, f). However, these potentiated heat responses occurred exclu- 
sively in AITC-sensitive neurons, and were fully suppressed in the 
presence of HC030031 (Extended Data Fig. 3a, b, f). Likewise, H.O2 
failed to enhance the heat responses in TKO neurons: heat-insensitive 
TKO neurons never gained heat sensitivity in the presence of H,O2 
(Extended Data Fig. 3c), and the few residual heat responders (about 
3%) among TKO neurons were not potentiated by HO, (Extended 
Data Fig. 3d, f). Notably, the residual heat responses in TKO neurons 
were also not affected by the TRPM2 antagonist 2-APB (Extended 
Data Fig. 3e). Overall, these data do not provide support for the 
notion that TRPM2 contributes significantly to heat responses in 
somatosensory neurons, and instead demonstrate that TRPA1 
mediates H,O>-sensitive, TRPV1- and TRPM3-independent heat 
responses in these cells. 

Consistent with earlier research’, we were unable to reliably mea- 
sure heat responses in non-neuronal CHO cells expressing mouse 
TRPAI (Extended Data Fig. 4a, d). However, about 40% of the CHO- 
TRPAI cells showed a robust heat-evoked Ca”* response following a 
brief preincubation with HO, (Extended Data Fig. 4b, d), and these 
heat responses were fully suppressed by HC030031 (Extended Data 
Fig. 4c, d), in line with a recent report showing that the tempera- 
ture sensitivity of TRPA1 is strongly dependent on the redox state”*. 
Together, our results indicate that the heat responsiveness of mouse 
TRPAI is highly dependent on the cellular environment: under con- 
trol conditions, expression of TRPA1 induces robust heat responses 
in sensory neurons (Figs 1a, c, d, 2c) but not in non-neuronal cell 
lines (Extended Data Fig. 4); in both cellular environments, TRPA1- 
dependent heat responses are strongly enhanced by HO, (Extended 
Data Figs 3, 4). In this context, a recent study of TRPA1 in the planarian 
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Figure 3 | Heat sensitivity of sensory fibres innervating the skin of TKO 
mice. a, Percentage of mechanosensitive C fibres innervating the paw skin 
from wild-type, DKO and TKO mice that show a response to heat (48 °C) 
(*P=0.041, **P=0.0057, #P = 0.037; Fisher’s exact test). b, Thermal 
threshold for activation of heat-sensitive C fibres. Horizontal lines in b, d, 
e indicate mean. c, Peak firing frequency of heat-sensitive C fibres during 
a heating stimulus. The dotted line at 23 Hz indicates the border between 
low frequency and high frequency firing (see Extended Data Fig. 5). 
*P=0.021, **P= 0.00086; one-way ANOVA with Tukey’s post-hoc test. 

d, e, Mechanosensitivity (d) and conduction velocity (e) are unaltered in 
C fibres from the different genotypes. Black dots overlaying TKO data 
points mark the heat-sensitive C fibres. f, Percentage of mechanosensitive 
A6é fibres in mice that show a heat response. *P = 0.05; Fisher’s exact test. 
ND, not done. Number of tested fibres: wild-type C, n = 35, Ad, n=31; 
DKOM¥Al C, n= 17, Ab, n= 14; DKOM/V! C, n= 18, Ab, n= 13; DKOY"A! 
C, n=10; TKO C, n=23, Ab, n=41. 


flatworm Schmidtea mediterranea (smed-TRPA1) demonstrated that 
H2O2 and reactive oxygen species are rapidly (within seconds) pro- 
duced as a result of heat exposure, leading to smed-TRPA1 activation 
and TRPA1-dependent acute heat avoidance”. Our results also do not 
exclude the possibility that heat-induced TRPA1 activation occurs 
downstream of an as yet unknown heat sensor. 

Next, we evaluated the heat responses of single C and Aé fibres 
innervating the skin of wild-type, DKO and TKO mice. The propor- 
tion of C fibres that responded to a heat ramp to 50°C was reduced in 
TKO mice (Fig. 3a). We further characterized the thermal response 
characteristics of heat-sensitive C fibres from wild-type, DKO and 
TKO mice, by analysing their thermal thresholds and peak firing rates 
(Fig. 3b, c). Overall, the five genotypes did not differ significantly 
with respect to the thermal threshold (P= 0.07, one-way ANOVA; 
Fig. 3b). The peak firing rates of C fibres showed a bimodal distri- 
bution, and, accordingly, heat-sensitive C fibres were classified as 
either high frequency or low frequency, depending on whether their 
peak firing rate was above or below 23 Hz, respectively (Extended 
Data Fig. 5). Out of 23 C fibres from TKO mice, none was classi- 
fied as high-frequency heat-sensitive, which compares to 23% high- 
frequency heat-sensitive C fibres in wild-type mice (8/39; P=0.021, 
two-tailed Fisher’s exact test) and between 11 and 40% in the different 
DKO mice (Fig. 3c). Notably, TKO C fibres did not differ signifi- 
cantly from wild-type or DKO C fibres with respect to mechanical 
thresholds and conduction velocities, indicating that the deficit in the 
TKO C fibres is specific for heat, without affecting general neuronal 
excitability (Fig. 3d, e). In addition to C fibres, a subset of mouse 
A6 fibres also exhibits responses to heat. Whereas approximately 
10% of wild-type Aé fibres respond to the heating ramp, no TKO 
Aé fibres out of 41 tested responded to heat (Fig. 3f). Histological 
analysis did not reveal any significant difference in the nerve bundles 
innervating the hindpaw skin of TKO mice (Extended Data Fig. 6). 
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Figure 4 | Selective loss of acute heat nociception in TKO mice. 

a, b, Withdrawal latencies of male mice in the hot-plate (a) and tail-flick (b) 
assays. Dotted lines indicate cut-off times. Results obtained at additional 
temperatures and statistical analyses are provided in Extended Data Fig. 7. 
Horizontal lines in a, b, d-g indicate mean. c, Percentage of tested animals 
(WT: N= 16; DKOM*/41, N= 12; DKOY"/4!: N= 15; DKOM?/V!: N= 12; 
TKO: N= 16) that did not withdraw their tail before the cut-off. 

d, Withdrawal latencies of wild-type (N= 13-14) and TKO (N= 15) mice 
to smooth plastic and jagged metal clips. P= 0.50 and 0.86, respectively; 
two-sided Student's t-test. e, Sensitivity of wild-type (N= 15) and TKO 
(N= 15) mice to calibrated von Frey hairs (P = 0.09; two-sided Student's 
t-test). Ts, 50% paw withdrawal threshold. f, Withdrawal latencies 

of wild-type (N= 19) and TKO (N= 12) mice in the cold plantar test 

(P= 0.76; two-sided Student's t-test). g, Number of icilin-induced wet 
dog shakes in wild-type (N= 12) and TKO (N= 11) mice (P=0.20; 
two-sided Student's t-test). h, i, Representative experiments depicting the 
positions of a wild-type (h) and a TKO (i) mouse on a thermal gradient. 
Border zones of the gradient are red (>45 °C) and blue (<10°C). j, Mean 
presence time + s.e.m. in the different temperature zones during the first 
and second 30-min periods (N= 10 mice for each genotype). k, Two- 
temperature choice test with control plate set at 30°C and test plate set at 
30°C or 45°C. Mean percentage + s.e.m. of the total time spent on the test 
plate (N=9 mice for each genotype; P=0.77 at 30°C, P=0.16 at 45°C; 
two-sided Student's t-test). Both genotypes showed statistically significant 
avoidance at 45°C (P=0.001; one-sample t-test versus 50%). 


Overall, these results indicate that combined ablation of TRPA1, 
TRPV1 and TRPM3 causes a striking deficit in the heat sensitivity 
of Aé and C fibres. 

Next, we compared TKO mice with wild-type mice and the different 
DKO mouse lines with respect to acute heat nociception. In both the 
tail-immersion and hot-plate assays, the TKO mice showed a severe 
deficit in heat avoidance behaviour, to such a degree that the major- 
ity of TKO animals reached the cut-off times without withdrawal 
response to a range of noxious temperatures over 45 °C (Fig. 4a-c; 
Extended Data Figs 7, 8a, b; Supplementary Video 1). In comparison, 
the different DKO mice all exhibited vigorous withdrawal responses 
to noxious heat, albeit with overall longer latencies in the case of the 
DKO™*Y! and DKOY"/“! mice (Fig. 4a—c; Extended Data Fig. 7). Note 
that, by using a cut-off latency set at four times the mean response 
latency of wild-type mice, we did not observe superficial signs of acute 
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skin burns immediately following the hot-plate or tail-immersion 
experiments in mice of any genotype. However, visual re-inspection 
of the animals three days after the 57°C tail-immersion experiments 
revealed severe tissue damage (scarring, necrosis) in the tails of all TKO 
mice, which was never observed in wild-type mice or any of the DKO 
mice (Extended Data Fig. 8d-f). Thus, TKO mice lack the acute heat 
response that is necessary to avoid burn injury. 

The absence of a rapid withdrawal response to noxious heat was not 
due to a general lack of sensitivity to aversive stimuli, as TKO mice 
showed wild-type-like pain responses in the tail clip assay (Fig. 4d), 
normal mechanosensitivity of their hindpaws in the von Frey assay 
(Fig. 4e; Extended Data Fig. 8c), and an unaltered withdrawal latency 
in the cold plantar assay”® (Fig. 4f). Likewise, the number of icilin- 
induced wet dog shakes, a measure of TRPM8-dependent cooling- 
induced behaviour, was not significantly different between wild-type 
and TKO mice (Fig. 4g). 

In addition to reflex avoidance responses to noxious cold or hot tem- 
peratures, mice exhibit thermotaxis behaviour towards their preferred 
ambient temperature, typically in the moderately warm (25-35 °C) 
range. To test whether TKO mice exhibited alterations in thermal 
preference, mice were allowed to move freely on a 1.2-m-long thermal 
gradient ranging from 5 to 50°C for 60 min. Wild-type and TKO mice 
showed extensive exploratory behaviour and exhibited a similar mean 
distribution on the gradient (Fig. 4h-j). Overall, we did not detect any 
significant differences between wild-type, TKO and the different DKO 
mice with respect to the total covered distance on the gradient or the 
peak occupied temperature (Thea; Extended Data Fig. 9). Mice of all 
genotypes explored the full length of the gradient, including the border 
zones (marked in blue (below 10°C) and red (above 45 °C) in Fig. 4h, i), 
and the combined time spent in these two zones was similar between 
genotypes (Extended Data Fig. 9c). However, whereas wild-type and 
the different DKO mice showed a preference for the blue (cold) versus 
the red (hot) border zone, this preference was not observed in the TKO 
mice (Extended Data Fig. 9a). 

In the two-temperature choice test, where animals were given the 
choice between a control plate at 30°C and a test plate at 45 °C, animals 
of all genotypes showed a similar preference for the control plate 
(Fig. 4k; Extended Data Fig. 9c). A striking difference between the 
genotypes was found when we analysed the timing of individual visits 
to the test plate: the duration of the longest visit to the 45°C plate was 
significantly longer for TKO mice than for wild-type or the different 
DKO mice (Extended Data Fig. 9b). A further experiment with the 
test plate set at 50 °C was prematurely terminated, as we observed that 
several TKO mice would stay for more than 60s on the test plate, 
risking burn injury. Together, these data demonstrate that thermal 
preference is largely preserved in TKO mice, despite the strong deficit 
in acute heat-induced pain. These results contrast with the phenotype 
of mice in which all TRPV1-positive neurons were ablated, which not 
only lack noxious heat responses but are also indifferent to tempera- 
tures between 30 and 50°C in the two-temperature choice assay*?”. 
The ability of TKO mice to discriminate temperatures in the 30-50 °C 
range may depend on the very small but consistent subset (about 3%) 
of TRPV1-, TRPM3- and TRPA1-independent heat-responsive sen- 
sory neurons (Fig. 1g; Extended Data Fig. 3), or on thermal stimu- 
lation of skin keratinocytes that then signal to sensory neurons via 
diffusible messengers”*. The subset of low-frequency heat-sensitive 
C fibres, which is conserved in the TKO mice (Fig. 3), are potential 
candidates to convey such keratinocyte-dependent signals. Finally, 
heat avoidance behaviour may be secondary to changes in core body 
temperature. In particular, the preoptic area of the hypothalamus 
contains temperature-sensitive neurons that detect small increases 
in local brain temperature and initiate both physiological heat loss 
mechanisms” and cold-seeking behaviour*”. 

Although the nociceptive neurons involved in acute pain signalling 
in mammals were described more than a century ago”, the molecu- 
lar mechanisms whereby these neurons detect harmful signals have 
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remained largely unresolved. The TKO mouse represents, to our 
knowledge, the first demonstration in mammals of elimination of 
the pain response to a noxious physical stimulus at the level of the 
transducer ion channels. The presence of three redundant molecular 
heat-sensing mechanisms with overlapping expression in nociceptor 
neurons represents a powerful fail-safe mechanism that ensures avoi- 
dance of noxious heat even under conditions where the function of 
one or two heat sensors is compromised, for instance due to channel 
desensitization or naturally occurring inhibitory ligands. We speculate 
that similar redundancy may operate in mammals for the detection of 
other noxious physical stimuli. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Animals. Wild-type C57BL/6J and Trpal~/~Trpm3~, Trpal~/~Trpv1~‘-, 
Trpm3~/~Trpv1~’~ and Trpal~/~Trpm3~/~Trpv1 ~~ mice on a C57BL/6J back- 
ground, aged 10-14 weeks, were used for all experiments. Male mice were used for 
behavioural experiments, unless stated otherwise; primary neurons and skin-nerve 
preparations were obtained from both male and female mice. Mice were housed 
in a conventional facility at 21°C on a 12-h light-dark cycle with unrestricted 
access to food and water. All experiments using animals were approved by the KU 
Leuven Ethical Committee Laboratory Animals under project numbers P021/2012 
and P265/2015. 

Isolated sensory neurons. After mice were killed by CO inhalation, trigeminal 
ganglia were excised, washed in neurobasal A medium (Invitrogen) supplemented 
with 10% fetal calf serum (basal medium), and then incubated for 45 min at 37 °C 
in a mix of 1 mg/ml collagenase (Gibco) and 2.5 mg/ml dispase (Gibco). Digested 
ganglia were gently washed twice with basal medium and mechanically dissociated 
by passage through syringes fitted with increasing needle gauges. Neurons were 
seeded on poly-L-ornithine/laminin-coated glass-bottomed chambers (Fluorodish, 
WPI) and cultured overnight at 37°C in 5% CO) in B27 (Invitrogen) supplemented 
neurobasal A medium, containing 2 ng/ml GDNF (Invitrogen) and 10 ng/ml NT4 
(Peprotech). For transfection experiments, trigeminal ganglia neurons were grown 
for 4 days before transfection with 0.5|1g cDNA encoding mouse TRPA1, TRPV1 or 
TRPM3 in the bicistronic pCAGGS-IRES-GFP vector, using Lipofectamine 2000 
(Invitrogen) as a transfection reagent. Neurons were kept at 37°C in 5% CO, for 
3 days after transfection before calcium imaging. 

Cell culture. CHO cells stably expressing mouse TRPA1 (CHO-A1 cells, kindly 
provided by A. Patapoutian) were cultured in DMEM containing 10% fetal bovine 
serum, 2% glutamax (Invitrogen), 1% non-essential amino acids (Invitrogen) and 
200 1g/ml penicillin/streptomycin at 37 °C in a humidity-controlled incubator with 
5% CO,”*. The cell line tested negative for mycoplasma contamination. Cell line 
authentication was not performed, but TRPA1 expression was verified based on 
AITC responsiveness. 

Calcium imaging. Changes in intracellular calcium concentration were monitored 
using ratiometric Fura-2-based fluorimetry. Cells were loaded with 21M Fura-2- 
acetoxymethyl ester (Alexis Biochemicals) for 30 min at 37°C. Fluorescence was 
measured during alternating illumination at 340 and 380.nm using either a CellM 
(Olympus) or Eclipse Ti (Nikon) fluorescence microscopy system, and absolute 
calcium concentration was calculated from the ratio of the fluorescence signals at 
these two wavelengths (R = F340/F3go) as [Ca?*] = Km X (R — Rmin)/(Rmax — R), 
where Ky; Rmin and Rmax Were estimated from in vitro calibration experiments with 
known calcium concentrations. The bath solution contained (in mM): 138 NaCl, 
5.4 KCl, 2 CaCl, 2 MgCl, 10 glucose, and 10 HEPES, pH 7.4. Allylisothiocyanate, 
pregnenolone sulfate, and capsaicin were obtained from Sigma-Aldrich and dis- 
solved in bath solution from a 1,000 stock solution in DMSO. At the end of each 
experiment, cells were subjected to a depolarizing solution containing 50 mM KCI, 
and non-responsive cells were excluded from analysis. 

The following procedure was used in sensory neurons to distinguish stimulus- 
induced responses from background variations in calcium concentration or 
calcium-independent temperature-induced changes in Fura 2 fluorescence. We 
calculated the time derivatives of the calcium concentration, F349 and F3g9 as well 
as the s.d. of these derivatives in the absence of any stimulus. A positive response 
was noted when the following criteria were simultaneously met: 1) the stimulus 
caused an increase of d[Ca?*]/dt exceeding 3 x s.d.; 2) dF349/dt > 3 x s.d.; and 3) 
dF3g0/dt <3 x s.d. 

For CHO-A\1 cells, a positive heat response was defined as a response that 
exceeded the mean response in the presence of HC030031 by at least five times 
the s.d., as described elsewhere”’. 

Whole-cell patch-clamp recordings. Membrane currents in isolated sensory neu- 
rons were measured in the whole-cell configuration at a constant holding poten- 
tial of -40 mV using an EPC-10 amplifier and PatchMaster Software (HEKA 
Elektronik). The extracellular solution contained (in mM): 140 NaCl, 4 KCI, 
2 CaCh, 2MgCl, 10 HEPES (pH 7.4 with NaOH), supplemented with 100 nM 
tetrodotoxin (TTX) to block voltage-gated sodium channels. The internal solu- 
tion contained (in mM): 140 CsCl, 1 MgCh, 1 EGTA, 10 HEPES and 5 tetraethyl- 
ammonium chloride (pH 7.2 with CsOH). 

Skin-nerve recordings. Single-fibre recordings from cutaneous C and Aé fibres 
of the saphenous nerve were obtained using an isolated skin-nerve preparation as 
described previously’”. A skin flap of the hind paw hairy skin and the innervating 
saphenous nerve were excised and fixed in an organ bath, corium side up. The 
chamber containing the skin tissue was continuously superfused with pre-warmed 
(30°C) synthetic interstitial fluid (SIF) solution, consisting of (in mM) 107.8 
NaCl, 3.5 KCl, 26.2 NaHCOs, 1.67 NaH2POu, 9.64 Na-gluconate, 0.69 MgSOu, 7.6 
sucrose, 5.05 glucose, and 1.53 CaCl;. The pH was buffered to 7.4 using carbogen 
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(95% oxygen and 5% carbon dioxide). The nerve trunk was placed on a mirror in 
a separate chamber filled with paraffin oil, desheathed and divided under micro- 
scopic control until a unit with a single distinguishable receptive field was obtained. 
Initially, mechanosensitive receptive fields were mapped using a blunt plastic rod. 
Once a distinct receptive field was identified, electrostimulation was applied to the 
receptive field through a metal microelectrode to determine the fibre’s conduction 
velocity. Values of conduction velocity were used for fibre classification with a cut- 
off criterion of <1.4m/s for unmyelinated C fibres. A marking technique, in which 
latency shifts are provoked through simultaneous application of a mechanical and 
electrical stimulus to the receptive field, was then applied, ensuring recording from 
a distinct single fibre. To assess the sensory properties of these mechanically sen- 
sitive fibres and for further classification into subclasses, the mechanical, cold and 
heat sensitivity of the individual fibres was tested. First, the mechanical threshold 
of the fibre was characterized using calibrated von Frey monofilaments ranging 
from 1 mN to 128 mN in a geometric scale. Subsequently, a rounded plastic ring 
(~8 mm diameter) was placed above the receptive field. The volume of fluid within 
the ring was evacuated and cold (by fluid replacement with cold buffer at 4°C), 
followed by radiant heat (ramp of 30-50°C delivered by a halogen lamp with an 
8-mm focus beam) stimuli were applied. Heat thresholds were taken as the tem- 
perature at which the second spike occurred. Data were acquired using the Dapsys 
data acquisition system (B. Turnquist, Bethel University, USA). 

Behavioural tests. Considering that we tested animals of five different genotypes, 
with a priori unknown somatosensory properties, we were not able to perform a 
reasonable power analysis to determine the sample size for behavioural experi- 
ments. Therefore, sample sizes were chosen to be consistent with those reported 
in similar behavioural studies. In nociceptive assays, mice from different geno- 
types were tested for all the different stimuli, so no randomization of experimental 
groups was necessary. 

In the tail-immersion assay, mice were immobilized in aluminium foil, which 
allowed free tail movement. The tip of the tail (one-third of the length) was 
immersed in a water bath maintained at 45, 48, 50, 52, or 57°C. The latency to 
a nociceptive response (withdrawal of the tail) was measured. To prevent acute 
tissue damage, the tail was removed from the bath immediately after a nociceptive 
response or upon reaching a cut-off time, which was set at four times the mean 
response latency of wild-type mice. 

In the hot-plate assay, mice were individually confined in a Plexiglas chamber 
ona metal surface set at 50, 55 or 58°C and the latency to a nociceptive response 
(licking or shaking of hind paws, jumping) was measured. To prevent tissue 
damage, mice were removed from the hot plate after a nociceptive response or 
upon reaching a cut-off time of three times the mean response latency of wild- 
type mice. 

In the thermotaxis assay, mice were individually tracked for 60 min in a thermal 
gradient apparatus (Bioseb), which consisted of a controlled and stable temper- 
ature gradient from 5 to 50°C on an aluminium floor. Tracking was performed 
using a video camera-based system and the time spent in two-degree zones was 
determined. 

In the two-temperature choice assay, mice were placed in a chamber containing 
two identical, adjacent floor platforms (Bioseb) with one set to 30°C and the other 
set to 45 or 50°C. Mice were free to explore for 10 min, and the total time spent on 
each surface and the number of times an animal moved to an adjacent platform 
were analysed. 

In the tail clip assay, either a plastic mini-peg (Maped) or metal crocodile clamp 
(Teishin, Japan), exerting a force of 5 N, was applied to the root of the tail, and the 
latency to a nociceptive response was measured. 

In the Von Frey test, mice were individually placed in a chamber with a wire 
mesh floor, and allowed to habituate for 15 min before testing. Mechanical 
responses were tested by stimulating the middle plantar surface of the hind paw 
with von Frey monofilaments using the up-and-down method, starting with 1 g. 
Biting, licking, or withdrawal during or immediately following the 3-s stimulus 
were considered as a positive response. 

In the rotarod assay, mice were initially placed on the stationary rod. After habit- 
uation, rotation was started at 4 r.p.m. (rotations per minute), accelerating overa 
5-min period to 40 r.p.m. This was done in three sessions during four trial days 
and one test day. The time taken for the mouse to fall from the rod was recorded. 

In the elevated maze test, mice were placed in a plus-shaped apparatus with 
two open and two enclosed arms, connected by a central platform, which was 
elevated from the floor. Mice were individually placed in the centre of the maze and 
observed for 5 min. The time spent in the open and enclosed arms was recorded 
using ANY-maze tracking software (Stoelting). 

To evoke icilin-induced wet dog shakes, mice were intraperitoneally injected 
with 50 mg/kg icilin, which induces robust shivering and shaking in mice. The inci- 
dence of these wet dog shakes was counted for 20 min after icilin administration. 
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In the cold plantar test, mice were acclimated on a 4-mm-thick glass plate, and 
then the withdrawal latency was measured when a compressed dry ice pellet was 
held against the glass surface underneath the hindpaw. The latency to withdrawal 
was tested three times for each hindpaw with an interval of ~15 min, and averaged 
to obtain a single value per mouse”®. 

RNA-seq. After mice were killed by CO; inhalation, DRGs were isolated and kept 
in RNAlater Stabilization Reagent (Qiagen) at —20°C until further processing. 
Total RNA was extracted using the RNeasy Mini Kit (Qiagen). The sequencing 
libraries were prepared according to the standard Illumina TruSeq mRNA stranded 
sample preparation protocol and each library was sequenced on an Illumina HISeq 
2500 according to the manufacturer’s recommendations, generating 50-bp single- 
end reads at the Genomics Core Leuven. Reads were aligned using the open-source, 
splice-aware tool Tophat (v2.0.13). Read counts per feature (genes or exons) were 
done by HTSeq (v0.5.3p3). Count data were analysed to identify differentially 
expressed genes using DESeq? (v1.01.1) at FDR 10%. 

In situ hybridization. Mice were transcardially perfused via the left ventricle 
with 25 ml PBS followed by 25 ml neutral buffered formalin (10% (w/v), pH 7.4). 
Subsequently, trigeminal ganglia were dissected and immersion-fixed in neutral 
buffered formalin for 6h. Thereafter, trigeminal ganglia were transferred to PBS for 
1h, and then to 70% ethanol until paraffin embedding. Paraffin-embedded tissue 
was cut into 4-\1m-thick sections using an RM2235 rotary microtome (Leica), 
and probed with digoxigenin-labelled antisense Trpm8 RNA probes. Antisense 
RNA probes were generated using the MAXIscript SP6/T7 Transcription Kit 
(ThermoFisher Scientific), using a 204-bp cDNA fragment of mouse Trpmé8 (acces- 
sion number NM_134252.3) corresponding to nucleotides 1779-1982 (exons13 
and 14). Hybrid molecules were detected using the TSA Plus Fluorescein Kit 
(Perkin Elmer) according to the manufacturer’s instructions. 

Nerve bundle anatomy. The plantar skin of the mouse hind paw was fixed in 4% 
paraformaldehyde and further processed for staining. Four-micrometre sections 


were incubated for 30 min in HO> (3%) and antigen retrieval was performed in 
0.01 M citrate buffer (pH 6, 90°C, 1h). Slides were gradually cooled down and incu- 
bated in blocking buffer containing 2% bovine serum albumin (Sigma-Aldrich), 1% 
non-fat dry milk (Nestlé), 0.1% Tween (Merck), complemented with 1:25 normal 
goat serum (X0907, Dako), before overnight incubation with the primary antibody 
(4°C, 1jg/ml rabbit polyclonal anti-PGP9.5, Z5116, Dako). Slides were washed 
and blocked again with blocking buffer (15 min), after which they were incubated 
with the biotinylated swine anti-rabbit secondary antibody (30 min, 1:400; Z0196, 
Dako) supplemented with normal mouse serum (1:25; Dako). Staining was initi- 
ated by the addition of horseradish peroxidase-conjugated streptavidin (30 min, 
1:1,000; P0397, Dako) followed by 3-amino-9-ethylcarbazole substrate-chromogen 
(25 min). Sections were counterstained with Maeyer haematoxylin and mounted in 
glycerin jelly. For each mouse, the nerve bundle areas of 20 sections were evaluated 
with an average distance of 0.1 mm between the cross sections. 

Statistical analysis. Data analysis was performed using Origin software (v8.6-9.0; 
OriginLab). Group data are represented as mean +s.e.m. from n cells or fibres or N 
animals. Tests used for statistical comparison between groups are indicated in the 
text and figure legends. Whenever parametric statistics were performed, normality 
was assessed using the Shapiro-Wilk test and variances were checked to ensure that 
they were similar between groups being compared. If conditions for parametric 
tests were not met, non-parametric tests were used. The investigators were not 
blinded to allocation during experiments. Analysis of data from calcium imaging, 
skin-nerve and behavioural experiments was performed without knowledge of the 
genotype of the cells, tissues or mice. 

Data availability. Sequencing data that support the findings of this study have 
been deposited in the NCBI Sequence Read Archive (SRA) and are accessible 
under the BioProject ID PRJNA431738. All other relevant data sets are included 
within the article or are available from the corresponding authors on reasonable 
request. 
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Extended Data Figure 1 | Heat sensitivity is preserved in Trpm3~/~Trpv1~/— 


double knockout (DKO™*/Y!) mice. a, Percentages of wild-type 

(296 neurons from 6 mice) and DKO™*’Y! (1,353 neurons from 9 mice) 
trigeminal neurons responding to the indicated stimuli. b, Single-fibre 
nerve recording of a mechano- and heat-sensitive C fibre innervating 
the skin of a DKO™*/Y! mouse. c, Withdrawal latencies of wild-type 
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(N= 10) and DKOM*Y! (N=7) mice in the tail-flick (at 57 °C) and hot- 
plate (at 50°C) assays. Dotted lines indicate the cut-off time. Horizontal 
lines indicate mean. d, Venn diagram showing the level of functional 
coexpression of TRPM3, TRPV1 and TRPA1 in wild-type trigeminal 
neurons, based on responsiveness to PS, Caps and AITC (n = 662 neurons 
from 6 mice). 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved 


LETTER 


a 


cold menthol igilin K 


& 25 
Y 20 
s 
< 15 
[e) 
= 10 
() 
er 5 
0 
cold menthol 
d 100000 
@ TRPM3 
10000 A TRPA1 
w TRPV1 
@ m TRPM2 
® 1000 A TRPV2 
ra @ TRPV3 
Q @ TRPV4 
- 400 © ANO1 
x ORAI1 
w STIM1 
104 
e 10 100 1000 10000 100000 
WT (Reads) ” = c 
2 [e) S a 
3 0) o 2£ a > iS 
= = : 2 8 g 8 5 ‘2 Expression 
5 San Foe geg © = 
S aS &S g@ 222 5a 8  B (log, fold vs WT) 
Oo = -o n= © 0 S&F «ee FB e) 
> co Qa po o SE oo © fo) Ss = 
eases ea 2 ee dd 22a S56 @ 3 2.000 
rF oczOsz a ¥~ozZz 2000 O00 F€£ Zz , 
moore AR Re Oe ir a eS 1.500 
f- 
Trpat” | II | | Im | | 8 hee 
Trpm3” | 0.5000 
Trpvt” 0.000 
pKO*™s | || | -0.5000 
ee = 
DKO™™' | | | 2.000 
TKO | = 


Extended Data Figure 2 | TRPM8 expression and RNA-seq analysis 
of sensory neurons. a, In situ hybridization using a TRPM8-specific 
probe in trigeminal ganglia of wild-type and TKO mice. b, Example of a 
sensory neuron from a TKO mouse responding to cold (15°C) and the 


TRPM8 agonists menthol (50 |1M) and icilin (11M). Scale bar, 6s/100 nM. 


c, Percentages of cold-responding and menthol-responding neurons in 
wild-type (1 = 225 from 3 mice) and TKO mice (n= 189 from 3 mice). 
** P = 0.0089; Fisher’s exact test. d, Transcriptome-wide comparison of 


mRNA expression between wild-type and TKO mice. Several ion channels 
that have been previously implicated in heat sensing are indicated. e, Heat 
map showing the differential expression levels in single, double and triple 
knockout mice of a set of about 200 genes implicated in somatosensation”®. 
Expression levels are defined as the logy fold change compared to the wild 
type. Except for Trpm3, Trpv1 and Trpal, no genes showed more than 50% 
up- or downregulation of expression in TKO compared to wild-type mice. 
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Extended Data Figure 3 | Modulation of heat responses by H20>: 

the role of TRPAI. a, Representative examples of responses to heat, in 
the absence and presence of H2Oz, in an AITC-sensitive and an AITC- 
insensitive sensory neuron from DKO™/Y! mice. b, Example of the 
suppression of heat responses in an AITC-sensitive DKOM*’Y! neuron 
by the TRPA1 antagonist HC030031 (100 1M). c, Heat-insensitive TKO 
sensory neurons are also insensitive to HzO. (n= 147 from 4 mice). 

d, e, Heat responses in TKO sensory neurons (which occur in only about 
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4 mice). Scale bars in a—e, 60s/100 nM. f, Average heat-induced increases 
in intracellular calcium during the three consecutive heat responses in the 
protocol shown in a-d, for the indicated subtypes of neurons. Number 

of tested trigeminal neurons for the data in f: DKO™V! n= 308 from 
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Extended Data Figure 4 | HO, potentiates heat responses in CHO As a conservative estimate of TRPA1-dependent heat responses, cells 
cells expressing mTRPA1. a, Representative examples of responses to with a heat response larger than 5 s.d. above the mean of responses in the 
heat (45°C), AITC (501M) and the ionophore ionomycin (2 1M) in presence of HC030031 (dotted red line) were counted. Corresponding 
CHO cells stably expressing mouse TRPA1. b, c, Potentiation of the percentages are indicated. The HzO) group was significantly different from 
heat response by H2O> (100 1M) and inhibition by HC030031 (100M). the two other groups (***P < 0.00001; two-sided Mann-Whitney test with 
d, Population analysis of all tested cells corresponding to the experiments Bonferroni correction). 


in a—c. Control, n = 394; H2O2, n = 560; H.O2 + HC030031, n = 103 cells. 
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Extended Data Figure 6 | Nerve bundles in the skin. a, PGP9.5 staining 
of the plantar hind paw skin, representative of wild-type (N=5) and TKO 
(N=5) mice. Scale bar, 20\1m. Black arrows indicate nerve bundles. b, For 
each mouse, the nerve bundle areas of 20 sections were evaluated with 
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WT TKO 
an average distance of 0.1 mm between the cross sections. No significant 
difference was found between the two genotypes. P= 0.08; two-sided 
Student’s t-test. Group data are represented as mean + s.e.m. 
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Extended Data Figure 7 | Acute heat sensing at different temperatures. 
a-d, Withdrawal latencies for male mice of different genotypes in the 
tail-immersion assay at the indicated temperatures. The cut-off (dashed 
line) was set at four times the mean withdrawal latency of wild-type mice. 
Number of tested animals: wild-type, N= 10; DKO™*/A!l_ N= 6; DKOVA!, 
N=9; DKOM?'V! N=7; TKO, N=9. Horizontal lines indicate means. 

e, P values for pair-wise comparisons of the withdrawal latencies of the 
different genotypes. A two-way ANOVA was performed with genotype 
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DKO™*' 
DKOY"" 
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and bath temperature as factors. P values represent the result of Tukey’s 
post-hoc tests for the factor genotype, and thus quantify global statistical 
differences in latency over the five tested temperatures (45, 48, 50, 52 and 
57°C). Similar results, yielding P < 10~‘ for the comparison of TKO mice 
with either wild-type mice or mice of the different DKO genotypes, were 
obtained with a second independent cohort of male mice (5-6 mice per 
genotype). 
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Extended Data Figure 8 | Acute heat sensing and burn injury. 

a, Withdrawal latencies of female wild-type (N= 10) and TKO (N= 10) 
mice in the tail-immersion assay at the indicated temperatures. The 
cut-off (dashed line) was set at four times the mean withdrawal latency 
of wild-type mice. Indicated P values were obtained using a two-sided 
Student’s t-test. b, Withdrawal latencies of female wild-type (N= 10) and 
TKO (N= 10) mice in the hot-plate assay at the indicated temperatures. 
The cut-off (dashed line) was set at three times the mean withdrawal 
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latency of wild-type mice. Indicated P values were obtained using two- 
sided Student's t-tests. c, Sensitivity of female wild-type (N= 10) and 
TKO (N= 10) mice to calibrated von Frey hairs (P= 0.86; two-sided 
Student’s t-test). d, e, Photograph of the tails of male mice of the indicated 
genotypes (d) and close-up of the tails of five TKO mice (e) taken 3 days 
after a 57°C tail-flick experiment. f, Percentage of mice of the indicated 
genotypes exhibiting visual signs of burn injury 3 days after a 57 °C tail- 
flick experiment (five mice per genotype). 
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Extended Data Figure 9 | Thermal preference. a, Preference index for of the longest uninterrupted visit to the test plate. ***P=7.7 x 10-°, 
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Extended Data Table 1 | General properties of TKO mice 


Wild type TKO P value 
Body weight (g) at 12 weeks (males) 25.1+0.5(N=14)  27.0+0.9 (N=13) P=0.45 
Body core temperature (°C) 37.1+0.2(N=10)  37.940.3 (N=10) P=0.02 
Rotarod (maximal speed in rpm) 12.9 + 1.7 (N=6) 14.4 + 1.8 (N=6) P=0.56 
Elevated maze - time in open arms (%) 37.743.5(N=16)  35.9+6.5 (N=16) P=0.81 
Elevated maze - distance in open arms (%) 37.2+3.0(N=16) 32.9+6.4 (N=16) P=0.55 


Data are expressed as mean +s.e.m. from N mice. Indicated P values were obtained using a two-sided Student's t-test. 
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Electromechanical vortex filaments during cardiac 


fibrillation 


J. Christoph!?*, M. Chebbok?*, C. Richter), J. Schréder-Schetelig!, P. Bittihn®, S. Stein’, I. Uzelac®, F. H. Fenton’, 


G. Hasenfuf\2"*, R. F. Gilmour Jr’ & S. Luther)?:2/8:9-10 


The self-organized dynamics of vortex-like rotating waves, which 
are also known as scroll waves, are the basis of the formation of 
complex spatiotemporal patterns in many excitable chemical 
and biological systems!~*. In the heart, filament-like phase 
singularities®® that are associated with three-dimensional 
scroll waves’ are considered to be the organizing centres of life- 
threatening cardiac arrhythmias’ !>. The mechanisms that underlie 
the onset, maintenance and control!4~!° of electromechanical 
turbulence in the heart are inherently three-dimensional 
phenomena. However, it has not previously been possible to 
visualize the three-dimensional spatiotemporal dynamics of scroll 
waves inside cardiac tissues. Here we show that three-dimensional 
mechanical scroll waves and filament-like phase singularities can 
be observed deep inside the contracting heart wall using high- 
resolution four-dimensional ultrasound-based strain imaging. We 
found that mechanical phase singularities co-exist with electrical 
phase singularities during cardiac fibrillation. We investigated 
the dynamics of electrical and mechanical phase singularities by 
simultaneously measuring the membrane potential, intracellular 
calcium concentration and mechanical contractions of the heart. 
We show that cardiac fibrillation can be characterized using 
the three-dimensional spatiotemporal dynamics of mechanical 
phase singularities, which arise inside the fibrillating contracting 
ventricular wall. We demonstrate that electrical and mechanical 
phase singularities show complex interactions and we characterize 
their dynamics in terms of trajectories, topological charge and 
lifetime. We anticipate that our findings will provide novel 
perspectives for non-invasive diagnostic imaging and therapeutic 
applications. 

Spiral waves are found ubiquitously throughout nature“. These 
spiral waves form dynamically in many chemical and biological systems 
around two- or three-dimensional centres of rotation, which consist of 
singular points or filament-like lines of phase singularity”, respectively. 
In the heart, spiral waves are associated with cardiac fibrillation”®, as 
electrical vortex-like rotating waves or rotors can be observed on the 
surface of the heart during arrhythmia’. The core regions of these 
rotors, which can be displayed as phase singularities using fluorescence 
imaging of membrane voltage or calcium'*"%, are thought to expand 
into the heart muscle where they form vortex filaments that are asso- 
ciated with three-dimensional (3D) electrical scroll waves®. However, 
visualization of the 3D wave phenomena that occur within the cardiac 
muscle has remained a major scientific challenge. Despite substantial 
progress in the development of tomographic optical techniques!”~, the 
measurement of transient electrical scroll waves inside cardiac tissue 
has so far been impossible. Because vortex filaments are considered to 
be the organizing centres of fibrillation'?”’, insights into their dynamics 


inside the heart wall are thought to be key to understanding the nature 
of cardiac fibrillation’®"! and to the conceptualization and advance- 
ment of novel techniques for the control of heart rhythm disorders'*""*. 

Here, we present direct and fully time-resolved measurements of 3D 
mechanical scroll waves inside the contracting cardiac muscle of intact, 
Langendorff-perfused pig hearts. To elucidate the dynamics of electrical 
and mechanical wave phenomena in the heart, we combined panoramic 
fluorescence imaging (optical mapping) with high-resolution four- 
dimensional (4D) ultrasound imaging (Fig. 1 and Extended Data 
Figs 1-3). The rapid mechanical contractions within the entire 
ventricles during ventricular fibrillation were measured using fast 4D 
ultrasound imaging with a spatiotemporal resolution of 0.5-1.0 mm 
(field of view, approximately 10 x 10 x 10cm) and 50-188 Hz volumes 
per second (see Methods). Tissue deformation (strain) was calculated 
from the 4D ultrasound data, revealing the propagation of 3D mechani- 
cal waves of contracting and dilating tissue throughout the ventricular 
muscle during regular rhythm and cardiac arrhythmias (Fig. 1a, d, 2a). 
From the amplitude of the mechanical deformation wave, the corre- 
sponding phase was computed. We induced ventricular tachycardia 
and ventricular fibrillation inn =5 pig hearts (see Methods). During 
ventricular tachycardia, we observed a single stable 3D mechanical 
scroll wave within the left-ventricular wall (Supplementary Video 1). 
The phase of this 3D wave (Fig. 1a, d and Extended Data Fig. 1 a, b) 
revealed the formation a U-shaped vortex filament, which extends from 
the epicardial surface to deep inside the left ventricular wall and back. 
On the surface of the heart, the filament ended in two phase singular- 
ities with opposite topological charges, corresponding to two stable 
counter-rotating mechanical spiral waves. During ventricular fibril- 
lation, drifting U-shaped mechanical filaments were observed during 
ventricular fibrillation (Fig. 2a). In this example, the breakthrough of 
the filament through the epicardial surface resulted in the formation of 
two transient I-shaped filaments and two corresponding counter-rotat- 
ing, outwardly drifting mechanical scroll waves (Supplementary Video 
2 and Extended Data Fig. 1c). The formation, transition and extinction 
of U-shaped, I-shaped and O-shaped mechanical filaments was also 
observed during ventricular fibrillation (Fig. 2b), phenomena similar 
to those predicted by models for electrical filaments*. The mechanical 
wave phenomena that were obtained from tissue deformation analyses 
using ultrasound imaging were compared to the electrical activation and 
mechanical motion data that were obtained from simultaneous multi- 
modal optical mapping of the surface of the heart (see Supplementary 
Video 9 and Extended Data Figs 4d, e, 9). The optical mapping data 
confirmed the co-existence of rotating waves of mechanical deforma- 
tion and electrical activation, which appeared simultaneously within 
the same region of the heart and exhibited the same rotational sense 
and rotation period, and retained a similar morphological shape 
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Figure 1 | Measurement of intramural mechanical filament-like phase 
singularity. a, Double scroll wave (phase) and U-shaped mechanical 
filament (red) recorded at 134 volumes per second during ventricular 
tachycardia inside the left ventricular wall (LVW) of a contracting 
Langendorff-perfused pig heart using 4D ultrasound (Supplementary 
Video 1 and Extended Data Fig. 1 a, b). LV, left ventricle; RV, right 
ventricle; RVW, right ventricular wall; PS, phase singularity. b, Figure- 
of-eight spiral wave (membrane voltage) on the surface of the left and 
right ventricle with two phase singularities obtained by recording 


(Fig. 2a and Extended Data Fig. 9c). In general, we found that the 
cardiac muscle exhibits deformations during cardiac fibrillation” 
that retain vortex-like patterns and, like fingerprints of electrical 
vortex activity, reveal the topological organization of cardiac fibrilla- 
tion throughout the heart wall. Furthermore, we found that rotational 
centres of the mechanical rotor patterns can be displayed as phase 
singular points or filament-like lines of phase singularity, as has 
been shown previously for electrical activity'”'? (see Supplementary 
Videos 1, 2, 6-8, 11). 

To further elucidate the dynamics of electrical and mechanical 
vortex filaments, we conducted computer simulations of scroll waves in 
excitable and contractile cardiac tissue (see Methods). Figure 2c, d show 
simulation data of electromechanical vortex filament dynamics using 
realistic heart geometry and fibre orientation. Our simulations indicate 
that electromechanical turbulence in the heart is composed of entangled 
pairs of co-existing electromechanic vortex filaments. The filaments 
emerge in close vicinity to each other due to electromechanical 
coupling (Supplementary Video 3). The simulations suggest that the 
experimentally observed mechanical filaments inside the ventricular 
wall may be intrinsic dynamical entities that can be used to characterize 
the spatiotemporal organization of cardiac fibrillation. 

To investigate the dynamics and interaction of electrical and 
mechanical rotors experimentally, we have developed a multi-modal 
fluorescence imaging system, which allows for the simultaneous meas- 
urement of membrane voltage, intracellular calcium and mechanical 
strain on the surface of the heart (Fig. 3b-d). Standard optical mapping 
is very sensitive to mechanical motion of tissue, which may result in 
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Front view § 5.1 Hz 


Top view 


optical mapping data at the same time as the recording shown in a. The 
ultrasound transducer is indicated by (4); the white lines indicate the field 
of view. c, Schematic of the experimental setup. (1) temperature-controlled 
perfusion bath; (2) Langendorff-perfused pig heart; (3) high-speed 
EMCCD(electron-multiplying charge-coupled device) cameras (only two 
shown) and LED illumination (not shown); (4) 4D ultrasound transducer; 
(5) acoustic window (membrane); (6) 3D field of view of ultrasound 
transducer. See the Methods for details. d, Detail (front and top view) of 
the U-shaped mechanical filament (red) of the scroll wave shown in a. 


substantial distortions of the signal. To avoid these motion artefacts7!, 
pharmacological excitation—-contraction uncoupling agents have been 
applied in previous experiments to inhibit any contractile motion of the 
heart. By contrast, we used computer vision techniques to disentangle 
fluorescence signals (voltage, calcium) from mechanical deforma- 
tion, which not only overcomes the limitations imposed by motion 
artefacts22-2°, but also enables quantitative motion analysis 
(Supplementary Videos 4, 5). Figure 3b illustrates the marker-free 
motion-tracking algorithm. The algorithm estimates the displace- 
ment of the epicardial surface from a reference configuration to its 
instantaneous configuration at any subsequent time ft. This transfor- 
mation of coordinates is used to convert the fluorescence signal from 
the stationary camera frame of reference into the co-moving frame 
of reference. Within the co-moving frame of reference, fluorescence 
signals are essentially artefact-free. The efficacy and quality of the 
motion tracking may be assessed from the examples shown in Fig. 3c. 
The original fluorescence data show that the signal is substantially 
distorted by tissue motion, rendering a quantitative analysis impos- 
sible. However, within the co-moving reference frame, artefacts are 
significantly reduced, allowing the voltage and calcium signals to be 
reliably retrieved. Simultaneously, non-rigid motion tracking provides 
a measurement of the tissue coordinates and local deformation of the 
cardiac tissue as a function of time. An example of a simultaneous 
measurement of membrane voltage, calcium and the corresponding 
strain rate is shown in Fig. 3d (Supplementary Video 8). The time series 
show that the tissue experiences deformations that correlate with the 
sequence of electrical activation. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


on 
= 
® 
= 
= 
® 
Q 
x 
in 
Q 
2 oO 
ee 
| 
$3 
aE 
= | 
oa 
<x 
co) 
° 
J 
P< r t 
=| 7K ~ 
n” ta 7 
c , 
120 
rc] = 
= E 
E : : 
-80 Q 
f= 
Dilating o. 
i © rae 
2 w 
-» 
Rat Et 
2 3 5 
3 2 ae 
£— Contracting 
a 
d 120 
= 
= 
a 
~¢ 
-80 


= Dilating 


Strain rate 


Contracting 


Figure 2 | Electromechanical phase singularity in the experiments and 
simulations. a, Breakthrough of the U-shaped intramural mechanical 
filament (red) on the epicardial surface (LVW, pig) results in the formation 
of two transmural filaments during ventricular fibrillation and two phase 
singularities (circles) with opposite charges (counter-rotating spirals) 

on the surface. Optical mapping (voltage and strain rate) confirms 

the breakthrough of filament that is observed with 4D ultrasound 
(Supplementary Video 2). b, Creation and extinction of mechanical 
filaments (red) inside pig ventricles (grey shaded) between endocardial 


Applying this optical mapping system to ventricular fibrillation, we 
observed simultaneously co-localized rotors in the action potential, 
calcium and strain measurements (Fig. 3e, f) on the surface 
of intact Langendorff-perfused rabbit hearts (Supplementary 
Videos 6-8). The coupling of cardiac excitation and contraction (Fig. 3a) 
may result in a mechanical rotor, which consists of two adjacent regions 
of dilating and contracting strain rates rotating around each other (see 
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(endo) and epicardial (epi) heart surface during ventricular fibrillation 
imaged using 4D ultrasound. Snapshots show interaction and transition 
of I-, U-, and O-shaped filaments. c, d, Numerical simulation of electrical 
(green, action potential (AP)) and mechanical (red—blue: strain rate with 
contracting (red) and dilating (blue) rates of deformation) scroll waves 

in the rabbit heart (computed tomography scan) and rectangular bulk 
with associated electrical (green) and mechanical (red) vortex filaments 
(Supplementary Video 3). NU, normalized units. 


also Extended Data Fig. 4a—c). Analogous to electrical rotors, mecha- 
nical rotors are characterized by a phase singularity, indicating the 
centre of rotation. In order to investigate the spatiotemporal organi- 
zation of electrical and mechanical rotors, data and statistical analyses 
were obtained in rabbit (n = 3) and pig hearts (n =3). During ventricu- 
lar fibrillation, the position of electrical and mechanical rotors were 
calculated and tracked over time. In total, 3,180 trajectories of phase 
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Figure 3 | Optical mapping of electromechanical phase singularities 
during ventricular fibrillation. a, Excitation—contraction coupling”® 

in the cardiomyocyte. Action potentials (AP, green) trigger the release 

of intracellular calcium (orange), resulting in cellular contraction (red). 

b, Numerical motion tracking in optical mapping data disentangles 
mechanical motion (deformation) from voltage and calcium signals. 

c, Efficacy of numerical motion tracking and motion-artefact removal. Top 
graph, without motion tracking, the time-series shows artefacts. Bottom 
graph, action potentials and calcium measured on the contracting heart 
surface after motion tracking. Top image sequence, the non-tracked action 
potential wave shows substantial motion artefacts (with a high frequency 
spatial pattern). Bottom image sequence, after motion tracking, a spiral 
wave (action potential, dark indicates depolarized tissue) is revealed 
(Supplementary Videos 4, 5 and Extended Data Fig. 4a). 

d, Simultaneous measurement of action potentials, calcium and 
deformation during ventricular fibrillation using optical mapping. 


singularities (electrical and mechanical) were measured in pig hearts, 
and 1,142 trajectories of phase singularities were measured in rabbit 
hearts. Along these trajectories, the positions of the phase singularities 
were sampled at 250 or 500 Hz. Representative trajectories of the elec- 
trical and mechanical phase singularities (x-y-t plots) are shown in 
Figs 3f, 4f (rabbit) and Fig. 4a—c (pig). From these trajectories, basic sta- 
tistical information has been retrieved (Figs 3g, 4d and Extended Data 
Fig. 6a-d). 
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Repetitive electrical activation leads to a corresponding sequence of 
dilating (blue indicates strain rates >0) and contracting (red indicates 
strain rates <0) rates of deformation. Top, raw unnormalized strain 
rate signal containing amplitude variations. Bottom, normalized strain 
rate used for computing of the phase signal (Extended Data Fig. 4b). 

e, Rotating electrical (voltage and calcium) and mechanical activity 
during ventricular fibrillation. f, Phase singularity space-time plots of 
action potentials, calcium and mechanics on the ventricular surface. 
Mechanical phase singularities co-localize with voltage and calcium 
phase singularities over multiple rotations, even for meandering rotors 
(1 pixel (px) = 0.16 mm, 100 pixels = approximately the diameter of the 
heart (16 mm)). g, Histogram of distances between phase singularities 
(voltage-to-calcium, voltage-to-mechanics, calcium-to-mechanics) 
during ventricular fibrillation shows co-localization with peaks at 
approxately 0.8 + 0.8 mm (average distance between phase singularities is 
4.0+0.8mm). AU, arbitrary units. 


Our experimental data show that across hearts and species (pig, rabbit) 
the observed electrical and mechanical phase singularities remain similar 
in number and lifetime (Fig. 4g, h). The data also support the hypothesis 
that, on average, electrical phase singularities are associated with nearby 
mechanical phase singularities, with the average nearest neighbour dis- 
tance between an electrical and its corresponding mechanical phase 
singularity being small, compared to the distance to the next electrical 
or mechanical phase singularity (Fig. 4a—e and Extended Data Table 1). 
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Figure 4 | Dynamics of electromechanical phase singularity during 
ventricular fibrillation. a—c, e, Co-localization of epicardial voltage 
(green) and mechanical (red) phase singularities on the pig heart 

surface at 160-ms (a), 110-ms (b), 120-ms (c) and 140-ms (e) intervals 

(1 pixel = 0.78 mm). d, Histograms of nearest neighbour distances 
between phase singularities (voltage-to-voltage, mechanics-to-mechanics 
and voltage-to-mechanics) during ventricular fibrillation (pig). f, Phase 
singularity trajectories within intervals of 40 ms and 80 ms during 
ventricular fibrillation (rabbit, n = 3). g, Temporal evolution of electrical 


On the cellular level, electrical and mechanical processes in the 
heart are related through excitation—contraction coupling and 
mechanoelectrical feedback. On the organ level, diffusive coupling 
of cardiomyocytes allows electrical impulses to propagate as an acti- 
vation wave, which leads to well-coordinated and relatively uniform 
mechanical contraction of cardiac tissue in the normal heart. In the 
diseased heart, excitation—contraction coupling and mechanoelectrical 
feedback may by altered, resulting in contractile dysfunction and 
onset of arrhythmias. Although electrical and mechanical processes 
are intrinsically coupled, the physical nature of the emergent electrical 
and mechanical wave phenomena is very different. Electrical waves 
in the heart are solitary excitation waves (and, for example, annihi- 
late under collision) with a typical conduction velocity of 1m s~!. By 
contrast, electromechanical waves”’ are elastic deformation waves, 
which are initiated by the excitation waves via excitation—contraction 
coupling. However, their wave-like propagation may be altered owing 
to subsequent interactions between the mechanically interconnected 
physiological structures of the heart. Passive elastic waves travelling 
through the heart also have the potential to effectively introduce 
non-local mechanical interactions because of the large speed of sound 


Lifetimes : AP 


LETTER 


c 

2.404 

2.354 

2.304 

100 50 100 
Pixels 
e 
4+ 2 pixels Top view 

? {4 


Mech, - 
@nP 100 _ 37 


100 50 


Midwall 
frequency spectra 


i Surface 
frequency spectra 


PS elec. Calcium 
PS mech. Mech. Mech 
ECG 05 ECG 
0 
10 20 0 10 20 
f (Hz) f (Hz) 


and mechanical phase singularity numbers (rabbit). f, g, A and B indicate 
intervals of low and high phase singularity density, respectively. 

h, Histogram of lifetimes of electrical and mechanical phase singularities 
(rabbit). i, Frequency spectra of voltage, calcium and strain rate on 

the rabbit heart surface obtained from optical mapping, mechanical 
deformation (strain rate) within the heart wall obtained from 2D 
ultrasound and electrocardiogram recordings. Dominant frequencies are 
9+ 1Hz (Extended Data Figs 4e, 9a-c). 


(approximately 1,500 m s~') in cardiac tissue. Although our results for 
ventricular fibrillation and ventricular tachycardia provide evidence for 
coupling of the voltage and calcium dynamics (Extended Data Fig. 5), 
coupled electromechanical dynamics and the coexistence of electri- 
cal and mechanical phase singularities, elastic patterns should not be 
considered as a simple proxy for the underlying electrical activation. 
Consequently, we expect that the mutual coupling of electrical and 
mechanical waves in a deformable excitable medium, such as the 
heart, gives rise to very rich spatiotemporal dynamics that still need 
to be explored (Supplementary Video 12, Extended Data Figs 7, 8 
and Supplementary Discussion). Despite their complex interactions, 
electrical and mechanical phase singularities show similar properties, 
including topological charge, structure, dynamics and lifetime, suggesting 
that the spatiotemporal dynamics of the electrical and mechanical 
system are essentially two sides of the same coin. 

We believe that the imaging approach introduced in this study will 
provide the basis for the integrative study of these electromechanical 
phenomena (such as excitation-contraction coupling and mechano- 
electrical feedback) at the tissue and organ level in health and disease. 
We expect that these developments will help to unravel the mechanisms 
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that underlie the spatiotemporal organization of electromechanical 
turbulence during cardiac fibrillation and could lead to safer and more 
effective diagnostic and therapeutic clinical applications. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Data reporting. No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not blinded to 
allocation during experiments and outcome assessment. 

Animal ethics. All procedures regarding care and use of animals were solely per- 
formed by appropriately trained personnel and were in accordance with German 
animal welfare laws and were reported to the local animal welfare representatives. 
Pre-experimental handling and humane welfare-oriented procedures strictly 
followed the regulations for animal welfare, in agreement with German legisla- 
tion, local stipulations and in accordance with recommendations of the Federation 
of European Laboratory Animal Science Associations (FELASA). All involved 
scientists and technicians have been accredited by the responsible ethics committee 
(the Lower Saxony State Office for Customer Protection and Food Safety—LAVES). 
Ex vivo electromechanical imaging. Experiments were conducted in isolated, 
intact Langendorff-perfused pig (n=5) and rabbit (n=5) hearts. Ventricular fibril- 
lation and ventricular tachycardia were induced using rapid pacing!*"’. Hearts 
were simultaneously imaged at high speeds using optical fluorescence (optical 
mapping) and ultrasound imaging. 

Optical fluorescence imaging. Fluorescence imaging was performed using either 
a panoramic imaging setup with 4 high-speed cameras (pig, together with 4D ultra- 
sound) imaging voltage only (mono-parametric) or a monocular single-camera 
setup (rabbit, together with 2D ultrasound) filming voltage and calcium simulta- 
neously (multi-parametric), as previously described”’. Imaging was performed at a 
frame rate of 500 Hz with a spatial resolution of 0.16-0.78 mm with 2-10-cm fields 
of view with high-speed EMCCD cameras (Evolve, 128 x 128 pixels, Photometrics) 
together with high-numerical-aperture lenses (pig: 9mm, f1-1.4, Fujinon; rabbit: 
52mm, f0.95-16, Tokina). Hearts were imaged from the side through the glass 
walls of an eight-sided aquarium at a distance of about 20.0-40.0 cm. During 
mono-parametric imaging of voltage, green excitation light was provided by up 
to 12 high-performance light-emitting diodes (LEDs, 532 nm + 10 nm centre 
wave length). During multi-parametric imaging of voltage and calcium, two sets 
of high-performance LEDs (three green and three red LEDs, LZ4-G40-100 and 
LZ4-R40-100, LED Engin; filtered at 550 + 5nm and 640 + 5 nm) were switched 
rapidly (500 Hz) using a custom-made fast-switching diode driver box (I.U. and 
EH.E, Georgia Institute of Technology), resulting in an effective frame rate of 
250 Hz for voltage and calcium recordings, respectively. Emission light was filtered 
using red/near-infrared longpass or bandpass filters (mono-parametric imaging, 
longpass filter with transmission >610 nm; multi-parametric imaging, dual- 
bandpass filter with transmission bandwidths at 585 nm and 800 nm, ET585/50nm 
and 800/200 nm; Chroma Technologies). In mono-parametric imaging mode, the 
tissue was stained with a voltage-sensitive dye (Di-4-ANEPPS, 20 ml of 1 mMol ii 
concentrated dye in Tyrode solution). In multi-parametric imaging mode, the 
tissue was stained with a voltage-sensitive dye (Di-4-ANBDQPQ, 40 il bolus injec- 
tion of 24.4mg ml! concentrated dye in ethanol) and a calcium-sensitive dye 
(Rhod2-AM, 500 il and 4.5 ml Tyrode solution over 5 min injected into the bubble 
trap). Pharmacological excitation-contraction uncoupling substances were not 
used. Cameras were triggered externally (20 MHz Function/Waveform Generator, 
33220A, Agilent) and trigger signals were captured using a data-acquisition 
system (MP150/AcqKnowledge, Biopac Systems Inc.). Movies were acquired using 
custom-made recording software (MultiRecorder, version 2.0.9, Research Group 
Biomedical Physics, Max Planck Institute for Dynamics and Self-Organization). 
Motion tracking, motion compensation and strain analysis in fluorescence 
video data. The heart’s motion was tracked using Lucas—-Kanade optical flow esti- 
mation algorithms*’’. The contrast in the video data was enhanced to facilitate 
motion tracking, as previously described”®. The tissue’s mechanical configuration 
was tracked in every pixel throughout the sequence of images with respect to one 
selected reference frame. The resulting time-varying, two-dimensional displace- 
ment vector fields were stored for further processing and erroneously tracked 
displacements were determined by computing the deviations of the vectors in a 
small neighbourhood of pixels (3-5 pixels diameter) and outliers were removed 
based on a threshold value (tr =5.0 accumulated length in pixels) or by removing 
individual vectors with lengths that exceeded a threshold value (tr = 4.0 pixels). 
The missing vectors (less than 5%) were substituted by a vector interpolated from 
the remaining surrounding displacement vectors (within 5-7 pixels diameter). 
The resulting time-varying vector field was smoothed in space and over time using 
spatiotemporal filter kernels with a Gaussian profile (diameter d= 3-5 pixels, 5-7 
frames linear averaging). To obtain the instantaneous displacements between 
two subsequent frames, each frame was subtracted from the previous frame. The 
resulting time-varying vector field of instantaneous displacements was smoothed 
in space and over time using spatiotemporal filter kernels with a Gaussian pro- 
file (radius r= 20-30 pixels, 5-11 frames linear averaging). From the smoothed 
vector field the gradient deformation tensors and the Green—Lagrangian strain 
tensors were computed. The data of the tensors were reduced to scalar-valued 
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data calculating tensor invariants, that is, the tensor trace, principal stretches or 
eigenvalues. The strain invariant (tensor trace) was normalized using sliding- 
window normalization as shown in Fig. 3d or Extended Data Fig. 4b, with a 
window width w of at least 1x the period of the dominant frequency and typically 
not more than 1.5-2.5 periods of the dominant frequency. The resulting time- 
varying strain rate map was smoothed in space and over time using spatiotemporal 
filter kernels with a Gaussian profile (d=5 pixels, 7 frames linear averaging). 
Warped, motion-stabilized videos were created using the evolution of the tracked 
tissue configurations to redistribute the intensity distribution of each pixel from 
the current frame along the inversely shifted direction to the tissue configuration 
of the reference frame. Pixel values were resampled into a regular pixel grid corre- 
sponding to the reference configuration using polygon clipping. All computational 
routines for extraction, processing and visualization of the fluorescence imaging 
data were custom-made routines written in C/C++ and OpenGL. 

Ultrasound imaging. Ultrasound imaging was performed either with a 4D 
ultrasound probe (4Z1c, matrix-array transducer, 2.8 Mhz centre frequency) and 
system (Acuson sc2000, Siemens AG) in pig hearts, or a 2D cross-sectional probe 
(MS-550D, linear array transducer, 22-55 MHz broadband frequency) and system 
(Vevo 2100, FUJIFILM Visualsonics Inc.) in rabbit hearts. 4D imaging refers to 
the acquisition of 3D volume images over time (3D + f). 2D imaging refers to the 
acquisition of 2D images over time (2D + t, B-mode). The 4D probe produces 
volume frames with a pyramidal shape (maximum opening angle 90 x 90 degrees, 
6-14cm lateral depth or pyramid height) and a relatively coarse spatial resolution 
(approximately 100 voxels*) but large fields of views (approximately 3 x 6 x 6cm? 
to 10 x 10 x 6cm?) at volume frame rates ranging from 51 volumes per second 
to 188 volumes per second depending on the lateral depth and opening angle 
of the pyramid (see Supplementary Table 1 for details). The spatial resolu- 
tion is approximately 0.5 mm? and decreases with the lateral distance from the 
transducer. The probe was positioned below an acoustic window (latex membrane) 
centred at the bottom of the tissue bath (Fig. 1c). A transducer mounting system 
(custom-made mounting arm based on dial indicator holder arm, Hoffmann 
Group) allowed arbitrary alignment of the imaging field of view (around several 
axes) and to secure the transducer in a stationary position once the desired imaging 
position and alignment was found. The field of view of the ultrasound imaging 
was facing upwards, covering the ventricles of the pig heart (Fig. 1b, c). The entire 
ventricular walls were imaged at a temporal resolution of 5.3-11.8ms (6-10 cm 
lateral field of view, 85-188 volumes per second). With a reduced field of view 
(that is, 36° x 90° to 50° x 90°), the entire left ventricular wall was imaged at a 
temporal resolution of 5.3-7.5 ms (134-188 volumes per second). The sampling 
distance during high-speed 4D imaging was reduced to enhance imaging speeds 
(T2 pre-setting from manufacturer). The 2D probe produces high-resolution 
(13 um in lateral and 271m in transversal direction) cross-sectional (B-mode) 
ultrasound speckle images. Imaging was performed at low line densities (pre- 
setting: 128 lines) at frame rates ranging from 279 to 309 frames per second with 
fields of views ranging in the order of approximately 1 cm x 1 cm and the frame 
rate varying with B-mode image widths and depths (see Supplementary Table 1 
for details). The probe was inserted into the bath from the top facing the bottom 
of the aquarium. Hearts were imaged with the 2D imaging plane embedded 
within the left ventricular wall, intersecting the wall tangentially approximately at 
midwall. A transducer mounting system allowed arbitrary alignment of the field 
of view (around several axes) and fixing the transducer in a stationary position. 
The imaging plane normally approximately coincided with the optical axis of the 
fluorescence imaging setup. Ultrasound TTL-trigger signals were captured using 
a data-acquisition system and software (MP150/AcqKnowledge, Biopac Systems 
Inc.), to be able to synchronize the start of the recording with other acquired data. 
Ultrasound data post-processing. 2D and 3D ultrasound movies were acquired 
in RF-format (proprietary formats from manufacturers, FUJIFILM Visualsonics 
Inc. and Siemens AG) and saved as 3D and 4D arrays (custom binary format, 
8-bit brightness mode, speckle intensities as integer values ranging from 0 to 
255). The arrays were then converted to 32-bit floating point data containing the 
normalized speckle intensities (ranging from 0 to 1), the intensity values being 
normalized by the maximal and minimal speckle intensity values in the entire 
array (or entire recording or part of the recording). 3D speckle images were 
resampled (custom-made python routine, using VTK-ProbeFilter resampling filter, 
VTK Visualization Toolkit, Kitware Inc.) from a pyramid-shaped image format 
into a Cartesian image format with a regular voxel grid structure with grid sizes 
in the order of 100 x 100 x 100 voxels. The height of the pyramid corresponds 
approximately to one of the lengths of the box. 2D speckle image sizes were in the 
order of 500 x 500 pixels and typically reduced in size to about 300 x 300 pixels. 
Motion tracking and strain analaysis in ultrasound data. Motion within the 
volumetric (3D, pig) or cross-sectional (2D, rabbit) ultrasound movie data was 
tracked using custom-made motion-tracking algorithms*!~* based on speckle 
correlation and block-matching. The same algorithm was used for the 2D and 3D 
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data. Only voxels and pixels showing the heart muscle were considered during 
the tracking procedure. The image data were segmented into two regions, one 
showing tissue and one showing the bath (pixel and voxel values: 0 = no tissue, 
1=tissue, Image]/Fiji segmentation editor and MATLAB). Ball- (3D) or disk- 
shaped (2D) blocks or interrogation windows (with diameters of 8-12 voxels and 
16-22 pixels) were used to determine shifts of the tissue (within a radius of r=4 
voxels and 10 pixels) in the image data and associate the positions of shifted tissue 
regions in between two frames. The displacements were determined by minimizing 
the mean quadratic difference (or absolute difference) of two blocks from two 
consecutive frames. Sub-pixel displacement accuracy was achieved by computing 
image moments (weight or centroid in 2D or 3D) in the resulting data containing 
the spatial distribution of mean quadratic differences for given (whole pixel) shifts. 
The instantaneous displacements between two subsequent frames were then 
tracked throughout the sequence of images. Despite the noisy speckle image data, 
tracking was found to detect a coherent displacement field indicating the shifts of 
the tissue robustly with few outliers. The resulting time-varying, two- or three- 
dimensional displacement vector fields containing the shifts of the tissue from one 
(volume) frame to the next were stored and erroneously tracked displacements 
were determined by computing the deviations of the vectors in a small (ball- or 
disk-shaped) neighbourhood of pixels (3-5 voxels or pixels diameter) around each 
voxel or pixel and outliers were removed based on a threshold value. The missing 
vectors were substituted by vectors interpolated from the remaining surrounding 
displacement vectors (within 5-7 voxel or pixel diameter). The resulting time- 
varying vector field of instantaneous displacements was smoothed in space and 
over time using spatiotemporal filter kernels with a Gaussian profile (radius 
r= 12-16 voxels and 60-80 pixels, 3-5 frames linear averaging). Scalar-valued 
measures of deformation (tensor invariants, that is, the tensor trace, principal 
stretches or principal eigenvalues, red—blue colour code with red indicating con- 
tractile and blue indicating tensile strain rates) were computed from the gradient 
deformation tensor and the Green—Lagrangian strain tensor, which were in turn 
computed from the smoothed displacement vector field. The strain invariant 
(tensor trace) was normalized using sliding-window normalization with a w of 
at least 1x the period of the dominant frequency and typically not more than 
1.5-2.5 periods of the dominant frequency. The resulting time-varying two- or 
three-dimensional strain rate map was smoothed in space and over time using 
spatiotemporal filter kernels with a Gaussian profile (d= 7-11 voxels or pixels, 3-5 
frames linear averaging). All computational routines for extraction, processing and 
visualization of the ultrasound imaging data were custom-made routines written 
in C/C++, VTK and OpenGL. 

Computation of phase singularities. From the amplitude of membrane potential, 
intracellular calcium, surface and intramural or volume strain, the phase was 
computed. The phase was computed similarly for both experimental and 
simulation data using the Hilbert transform. Phase singularities were obtained by 
computing the line integral’?! 


PV elr)ds = 2n(p — n) (1) 


where y(r) is the local phase at a location vector rand the integral is taken over the 
closed path s. p and n are the numbers of positively and negatively charged phase 
singularities enclosed within the path s. equation (1) was used previously for the 
characterization of vortex dynamics during fibrillation®!*!>"". The statistics of 
electrical and mechanical phase singularities were obtained from n=3 pig and 
n=3 rabbit hearts. Tracking of mechanical phase singularities enabled the recon- 
struction of the motion and evolution of filaments or phase singular points across 
space and time. Locations of phase singular points or lines within the moving and 
deforming medium were stored in the Lagrangian laboratory-coordinate frame. 

Computer simulations. Computer simulations were performed using realistic 
heart geometries as well as bulk-shaped geometries. Actively contracting 
cardiac tissue supporting nonlinear waves of electrical excitation was simulated 
using electromechanically coupled reaction-diffusion mechanics numerical 
models**°, The three-variable Fenton—Karma model? was used for simulations 
in realistic, heart-shaped geometries. The two-variable Aliev—Panfilov model?” 
was used for simulations in cubic-shaped domains. Parameters for the Fenton— 
Karma model were set to 2 = 5.6093115, 7, = 62.41, 7.5 = 63.4124, Ty) = 8.36, 
Toa = 10.8037, Ty = 40.4284, Typ = 36.5891, Toy = 663.715, Tp = 132.373, 
u-= 0.13, Uy, =0.05, ues = 0.45 with settings for scaling and diffusion proper- 
ties as previously described®. Parameters for the Aliev-Panfilov model were set 
to a=0.05, b=0.5, 1) =0.1, [2 =0.3, ¢=0.01, as previously described". Soft- 
tissue elasticity was simulated using a discrete particle system with controllable 
elastic anisotropy, as previously described?*°*. Bulk-shaped simulation domains 
consisted of 80 x 50 x 80 hexahedral cells. Realistic, heart-shaped domains were 
obtained from computerized tomography scans of the rabbit hearts used in the 


experiments by segmenting the voxel data and discretizing the segmented heart- 
shaped domains into tetrahedral volume elements. The simulation grid typically 
consisted of 120 x 120 x 120 simulation grid cells, which each consisted of up to 
four tetrahedral cells if containing the segmented elastic medium, less than four 
cells if containing boundaries of the medium, or no cells if situated outside the 
medium. Electromechanical coupling was realized by introducing a partial differ- 
ential equation to model excitation-induced active stress build-up, as previously 
described****"?, The active stress variable was used to modulate the spring con- 
stants of the particle system to model active contractions. However, only springs 
pointing along the local muscle fibre orientation were modulated, introducing an 
actively contracting preferred orientation in the elasticity, as well as local transverse 
isotropy of the passive elasticity>***. The active springs can point into arbitrary 
directions in the two- or three-dimensional space. 

Time-series and signal analysis. Frequency power spectra of electrocardiograms 
and cumulative frequency power spectra of the 2D and 3D experimental data 
(voltage, calcium, strain rate) and 2D and 3D simulation data (electrical activity, 
strain rate) were computed using the fast Fourier transform. First, the fast Fourier 
transform was applied to the entire time-series with typical durations of 2-10, 
with sampling rates of 85-300 Hz (ultrasound), 250-500 Hz (fluorescence imaging) 
and 2 kHz (electrocardiogram). Next, its absolute square was computed to yield the 
frequency power spectrum. For spatially extended data (ultrasound, fluorescence 
imaging and simulations), the calculation was carried out independently for each 
pixel or voxel, and the resulting spectra were then averaged across the whole spatial 
domain to yield the cumulative power spectrum. 

Cardiac preparation ex vivo. Géttingen minipigs (female, 21-26 months old, 
45 + 3.5kg) and New Zealand white rabbits (female, 6-12 months old, 2.5-3.5 kg) 
were anaesthesized (pig: pentobarbital; rabbit: trapanal) and euthanized via an 
intraveneous injection of potassium chloride (KCl). Immediately after euthanasia, 
cardioplegic solution was injected intravenously for temporary cessation of cardiac 
activity. Hearts were excised rapidly and inserted into cardioplegic solution for 
temporary cessation of cardiac activity. Isolated hearts were transferred quickly into 
a bath filled with warm, oxygenated (95% O2 and 5% COz) Tyrode (151 distilled 
water, 130 mMol sodium chloride (NaCl), 4mMol KCl, 0.6 mMol magnesium 
chloride hexahydrate (MgCl,(6H20)), 2.2 mMol calcium chloride (CaCl), 
1.2 mMol sodium dihydrogen-phosphate NaH,PO,(H20), 24.2 mMol sodium 
bicarbonate (NaHCOs3) and 12 mMol glucose, pH 7.36-7.40) and connected to 
a retrograde Langendorff-perfusion system (Hugo-Sachs Apparatus). The flow 
rates of the perfusate were 200 ml min“! and 30 ml min for pig or rabbit hearts, 
respectively, at a perfusion pressure of 50+ 5mm Hg. Tyrode was kept at a con- 
stant temperature of 37 + 0.5 °C (custom-made temperature control, Max Planck 
Institute for Dynamics and Self-Organization) and was constantly reperfused. 
Any mechanical pressure to the hearts was avoided to prevent compression of the 
coronary arteries. 

Arrhythmia induction and termination, data acquisition. Arrhythmias were 
induced using rapid pacing via bipolar electrodes or defibrillation paddles, and 
terminated using a single high-energy cardioversion shock via defibrillation 
paddles inserted into the bath, if they had not self-terminated, as previously 
described!*"5. Electrical power was supplied by a power amplifier (BOB 100-4M, 
Kepco Power Supply). Electrocardiograms were recorded using three thin 
electrodes inserted into the bath and acquired using a data-acquisition system 
(2.0kHz sampling rate, MP150, Biopac Systems Inc.). 

Computerized tomography scans. Micro-computed tomography scans of 
formalin-fixed rabbit hearts were performed using a commercial computerized 
tomography scanner (GE CT 120, GE Healthcare) operating at 1m spatial resolu- 
tions, as previously described!°. Hearts were fixed immediately after the imaging 
experiments. 

Code availability. All custom computer source code used in this study is available 
from the corresponding author upon reasonable request. 

Data availability. The datasets generated during and/or analysed during the 
current study are available from the corresponding author upon reasonable request. 
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Extended Data Figure 1 | Electromechanical vortex filament dynamics singularities corresponding to two counter-rotating spiral waves (see also 
during cardiac fibrillation. a, 3D volumetric ultrasound speckle image Fig. la). c, Breakthrough of an U-shaped intramural mechanical filament 


showing a contracting isolated pig heart during ventricular tachycardia (red) on the epicardial surface of a pig heart results in the formation of 
(same heart as shown in Fig. 1a, b, lateral imaging depth 6cm). Mechanical _ two transmural filaments during ventricular fibrillation and two phase 


filament (red) measured within ventricular wall (white region of interest; singularities (green circles) with opposite charge (counter rotation spirals) 
see also Fig. 1) between left and right ventricle. b, Validation of ultrasound _ on the surface (see also Fig. 2a). Optical mapping (voltage and strain rate) 
measurement of mechanical filaments on the epicardium using optical confirms filament dynamics observed with 4D ultrasound. 


mapping (voltage) confirms co-localized electrical and mechanical phase 
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Extended Data Figure 2 | See next page for caption. 
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Extended Data Figure 2 | 3D Imaging of electromechanical activity in 
the heart using simultaneous ultrasound and fluorescence imaging. 
a-f, Reconstruction of the heart shape during simultaneous imaging 

of electromechanical cardiac activity using ultrasound imaging and 
panoramic optical mapping. a, Photograph of heart in the Langendorff 
setup with ultrasound transducer at the bottom of the bath. LA, left 
atrium. b, 3D optical reconstruction of the surface of the heart obtained 
by rotating the heart after the experiment. Photorealistic rendering of 

the surface of the heart on reconstructed 3D surface mesh. c, Rendering 
(transparent) of 3D volumetric ultrasound speckle data showing the 
contracting heart from a similar perspective. Fan indicates lateral imaging 
depth of approximately 6-8 cm that was used for high-speed imaging. 

d, Optical mapping video images showing a bipolar electrode touching 
the epicardium of the left ventricle. The electrode is bend at the top, such 
that the tip is perpendicular to the electrode (only the tip touches the 
epicardium). The approximate location of the tip is indicated by the white 
rectangles in a-d, g, i. The 3D reconstruction allows the determination of 
the position of the electrode’s tip on the heart surface from the two video 
images. e, Optical calibration target used for optical 3D reconstruction of 


heart surface. f, Acoustic calibration target for ultrasound imaging used to 
determine approximate alignment and position of transducer. The inset 
shows the corresponding ultrasound speckle image of the calibration 
target at a particular depth. g-j, Focal electromechanical activity in a 
contracting pig heart mapped with optical mapping and ultrasound during 
pacing. g, Photorealistic rendering of optically reconstructed heart. The 
heart is rotated towards the left compared to a, b. h, Action potential wave 
propagating outwards (green arrows) from stimulation (voltage-sensitive 
mapping using Di-4-ANNEPS, black-white colour code: dark corresponds 
to depolarizing tissue or the upstroke of the action potential). i, 3D 
ultrasound speckle movie data showing same part of the heart (opaque and 
transparent rendering). j, Elastomechanical activation computed from 3D 
motion of the tissue acquired with ultrasound. The tissue starts to contract 
first at the stimulation site. The strain rate and its phase representation 
pattern exhibit a focal point that coincides with the stimulation site and 
the focal point that is visible in the optical maps. a-d, g, i, The region in 
which electrical pacing (stimulation) was applied to the heart is indicated 
with a rectangle. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Volumetric and cross-sectional ultrasound 
imaging and imaging configurations during ex vivo and in vivo 
imaging. a, Isolated pig heart in imaging setup with 4D volumetric 
ultrasound transducer scanning from beneath the heart through an 
acoustic window, together with simultaneous panoramic fluorescence 
imaging (Fig. 1). 4D ultrasound imaging (Acuson sc2000, Siemens AG) 
was performed using a two-dimensional matrix phased-array transducer 
(4Z1c) producing volume frames with a pyramidal shape (51-188 volumes 
per second, maximum opening angle 90 x 90 degrees, 6-14.cm lateral 
depth). The heart is aligned very similarly within the pyramid-shaped 
field of view as in an examination of a patient in vivo. b, Isolated rabbit 
heart in imaging setup with 2D cross-sectional ultrasound transducer 
scanning from above, together with simultaneous single-camera 
fluorescence imaging (see also d and Supplementary Video 9). c, Imaging 
configuration during 4D ultrasound imaging and panoramic multi-camera 
optical mapping with four cameras (see also a). The optical axis of the 
cameras and the lateral axis of the ultrasound scanner are approximately 
perpendicular. d, Imaging configuration during 2D ultrasound imaging 
and single-camera optical mapping (see also b). The optical axis of the 
camera and the lateral axis of the ultrasound scanner are approximately 
perpendicular. The cross-sectional ultrasound imaging plane is positioned 


and aligned tangentially within the ventricular wall underneath and 
co-planar to the imaged surface. e, Imaging configuration during 3D 
ultrasound imaging in vivo. In ultrasound examinations of patients, the 
human heart is either imaged from outside the body through the chest 

in a transthoracic echocardiography (TTE) examination, or from within 
the body in a transoesophageal echocardiography examination using a 
transoesophageal ultrasound probe. The first measurement configuration 
provides a clearer picture of the ventricles, whereas the latter measurement 
provides a clearer picture of the atria. We aimed to mimic the TTE 
measurement imaging the ventricles using a probe that is routinely used 
in TTE examinations. In a TTE measurement, the transducer may be 
positioned on the chest below the heart imaging upwards through two ribs 
or from underneath the ribs. In one of the various views, the apex of the 
heart is facing the transducer (apical view), as it is located closest to the 
transducer array. The atria are located furthest away from the transducer 
array. The imaging configuration used in our experiment is very similar 
to this situation (a). The size of the human heart is comparable to the size 
of the pig hearts used in the ex vivo experiments. f, Frequent imaging 
configuration during 3D ultrasound imaging with panoramic optical 
mapping. One camera always imaged the anterior left ventricular wall 
(similar perspective as in a). 
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Extended Data Figure 4 | Mapping and signal analysis of 
electromechanical rotor patterns during ventricular fibrillation and 
tachycardia in isolated Langendorff-perfused hearts. a, Mapping of 
epicardial electrical (voltage) rotor activity during ventricular fibrillation 
on the contracting surface of a rabbit heart. Left, the time-series (black) 
obtained from the unstabilized fluorescence maps shows substantial 
motion artefacts, whereas the time-series of the motion-stabilized maps 
(green) shows a series of action potentials. Right, from the stabilized 
time-series, it is possible to compute the phase (black), that is, introducing 
phase jumps at the upstroke of the action potentials. Bottom, the upper 
image series shows a counter-clockwise-rotating action potential rotor 
(green) on the contracting heart surface (Supplementary Video 6). The 
lower image series shows the corresponding phase representation of the 
rotor and a phase singularity at the tip of the spiral. b, Left, time-series 
of strain rate obtained from the displacement field. Middle, normalized 
strain rate (see Methods). Right, zero-crossings of strain rate are used 
to obtain phase representation. Bottom, rotating wave of dilating (blue) 
and contracting (red) tissue regions rotating around each other and 
corresponding phase representation. A circle indicates the position 
of the mechanical phase singularity. Electrical and mechanical 
phase singularities are co-localized (Supplementary Video 6). 
c, Electrocardiogram during ventricular fibrillation (same heart as shown 
ina, b). Frequency spectra of electrocardiogram (black) and electrical 
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(voltage, green) and elastomechanical (red) patterns measured optically on 
the epicardial surface of the left ventricular wall (a, b), showing matching 
dominant frequencies (9 + 1 Hz). Frequency spectra of the patterns are 
cumulative spectra of all measurement sites (Supplementary Video 6). 

d, Electromechanical activity in isolated pig heart during ventricular 
tachycardia (Fig. 1 and Extended Data Fig. 1b). Top, time-series of action 
potential activity measured optically using voltage-sensitive optical 
mapping on anterior left ventricular surface. Middle, time-series of 
strain rate measured optically in same location as in the top panel on left 
ventricular surface using fluorescence imaging. Bottom, time-series of 
strain rate measured within the anterior left ventricular wall using 4D 
ultrasound imaging. n.u., normalized units. e, Electrocardiogram acquired 
in the isolated rabbit heart during ventricular fibrillation (Fig. 4i). The 
arrhythmic episode ended by self-termination of arrhythmic activity and 
transition into sinus rhythm. Frequency spectra of electrocardiogram 
and electrical (voltage, green) and elastomechanical (red) patterns 
measured optically on the epicardial surface and elastomechanical (red) 
patterns measured using ultrasound at midwall inside the left ventricular 
wall during arrhythmic episode (including data within 2.0 s before self- 
termination). The dominant frequencies are 9 + 1Hz for voltage, calcium 
and strain. Frequency spectra of the patterns are cumulative spectra of all 
measurement sites. 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | Coupled voltage and calcium dynamics 
mapped during ventricular fibrillation. Voltage (action potential, 
green, voltage-sensitive staining using Di4- ANBDQPQ) and calcium 
(intracellular calcium, orange, calcium-sensitive staining using 
Rhod2-AM) vortex wave activity mapped on ventricular surface of the 
isolated rabbit heart during ventricular fibrillation using dual-imaging”. 
Recordings were acquired in non-contracting hearts after administering 
of blebbistatin (4 ml diluted in 700 ml Tyrode, retrograde perfusion 

with constant reperfusion). a, b, Example time-series (n = 5) of voltage— 
calcium activity chosen from arbitrary sites on the surface. Time-series 
show closely coupled voltage-calcium wave dynamics with a dominant 
frequency of 9 + 1 Hz and a delay of the calcium activity. a, Activity on 
the surface within 2s long (left) and 1 s long intervals (right) to facilitate 
viewing of the upstrokes and the delay between voltage and calcium. 
Recordings acquired before administering (a) and after the addition of 
cromakalim (b, 10j:mol in 11] Tyrode, retrograde perfusion with constant 
reperfusion). We did not observe a noticeable difference in the dynamics. 
c, Voltage (action potential, green, voltage-sensitive staining using Di4- 


ANBDQPQ) and calcium (intracellular calcium, orange, calcium-sensitive 
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staining using Rhod2-AM) raw movies (black-and-white colour code) 
mapped on ventricular surface (field of view approximately 1.5 x 1.5cm’, 
time in ms) during ventricular fibrillation. The voltage-sensitive data 

are inverted (dimensionless normalized units (n.u.)), such that bright 

(or green) areas show depolarized tissue. Bright (or orange) areas in the 
calcium-sensitive data (dimensionless normalized units (n.u.)), show 
high intracellular calcium concentrations. The calcium-sensitive images 
show the same part of the tissue that is shown in the voltage-sensitive 
images, however, with a delay of 10 ms to compensate for the delay 
between voltage and calcium and to facilitate analysis of similar wave 
patterns. The patterns are highly similar and evolve congruently across 
the surface during ventricular fibrillation. Example image series chosen 
from a recording with a duration of 20s. The findings are the same across 
different recordings. The experiment was repeated twice in n =2 hearts. 
However, other studies*!““* have also shown that in abnormal voltage and 
calcium regimes the dynamics can have complex interactions and are not 
always necessarily closely coupled as shown here (see also Supplementary 
Discussion). 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | Characteristics of epicardial electrical and 
mechanical phase singularities during ventricular fibrillation in pig 
and rabbit hearts. a, b, Mean number of electrical (green) and mechanical 
(red) phase singularities (PS) during ventricular fibrillation imaged on 
the epicardial ventricular surface of isolated Langendorff-perfused pig 
hearts (a; four measurements from n = 3 hearts) and rabbit hearts 

(b; three measurements from n = 3 hearts). The number of phase 
singularities fluctuates strongly over time (error bars indicate the standard 
deviation; see also Fig. 4g). Both the electrical and mechanical average 
numbers of phase singularities similarly reflect different regimes of 
ventricular fibrillation (b). We consistently observed a slightly larger 
number of mechanical phase singularities (factor 1.2 + 0.1). The average 
numbers of phase singularities were computed from 10-s long (500 frames 
per second) or 20-s long (250 frames per second) recordings with 5,000 
video images, including > 10,000 measurements of phase singularities, 

the recordings were more than 100 times longer than the average period 
or lifetime of a rotor. c, Ratio of the mean number of electrical and 
mechanical phase singularities (ratio = number mechanical PS/number 
electrical PS) during ventricular fibrillation on epicardial ventricular 
surface of isolated Langendorff-perfused pig hearts (four measurements 
from n= 3 hearts) and rabbit hearts (three measurements from n= 3 
hearts). The ratio is close to 1 (1.2 £0.1) and consistently larger than 1, 
indicating that more mechanical than electrical phase singularities appear 
on the surface during ventricular fibrillation. Error bars are large as the 
number of phase singularities fluctuates strongly (from 0 to approximately 
10) over long times (Fig. 4g). Error bars were computed as the standard 
deviation of the fluctuations of number of phase singularities over time 
(>1,000 samples or time-steps). The ratio (or centre of the plot) was 
computed from the simple average of the number of phase singularities 
(>1,000 samples or time-steps). d, Co-localization factor sigma indicating 
the precision, with which a mechanical phase singularity describes on 
average the position of a nearby electrical phase singularity. For large 
phase singularity numbers (nps = 6-10; right) the precision is about 
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0.2 or 1/5 of the average rotor distance (of 1). In this regime, two phase 
singularities can clearly be separated from each other (Fig. 4a). For the 
computation of sigma, the average rotor distances and co-localization 
distances from the distributions as shown in Fig. 3g and Fig. 4a were used. 
For smaller phase singularity numbers (nps = 1-3; left) sigma increases, 
indicating that the precision with which a mechanical phase singularity 
predicts the location of an electrical phase singularity decreases. The two 
data points (on the left) describe dynamical regimes with few strongly 
meandering rotors with linear cores and larger overall deformations of 
the cardiac muscle (also data point 5 in c) during ventricular fibrillation 
in rabbit hearts (Fig. 3). However, lower sigmas were observed in both pig 
and rabbit hearts. Lower sigmas or a higher precision is obtained during 
ventricular fibrillation with a larger number of smaller rotors and a weaker 
overall deformation of the cardiac muscle. e, Trajectories of mechanical 
phase singularities during ventricular fibrillation on the surface of 

the rabbit heart. Mechanical phase singularities (red dots) computed 
individually from voltage-sensitive (top) and calcium-sensitive (bottom) 
imaging data during multimodal fluorescence imaging (voltage, calcium 
and strain) with interleaved acquisition of the two channels (250 frames 
per second per channel = 500 frames per second). The plots show the 
accumulated mechanical phase singularities during a 400-ms long time 
interval at 2s (left), 4s (centre) and 8s (right) of a 20-s long recording. 
The positions and trajectories of the mechanical phase singularities 
computed from the voltage or the calcium data are identical or at least 
almost identical throughout time. The corresponding strain rate patterns, 
from which the phase singularities were computed, are also almost 
identical or highly similar in each frame over time. The positions and 
trajectories of the mechanical phase singularities and strain rate patterns 
are also identical or at least almost identical when only one dye is used 
and one of the channels does not contain a fluorescent signal. The data 
demonstrate that the optically derived strain rate patterns are robust, that 
is, independent of fluorescence-induced image intensity fluctuations. 
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Extended Data Figure 7 | See next page for caption. 
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Extended Data Figure 7 | Electromechanical dissociation and loss 

of contractility during very rapid (20 Hz) ventricular fibrillation 

in pig hearts. a, Example traces of action potential wave activity 
measured on the left ventricular epicardial surface of a pig heart during 
ventricular fibrillation using multimodal optical mapping (voltage and 
contraction). Before the addition of the Karp channel opener cromakalim 
(baseline ventricular fibrillation, top), the ventricular muscle exhibits 
both fibrillatory electrical as well as contractile activity and associated 
deformations (see also Supplementary Video 12). Owing to the motion 
of the tissue, the optical traces are superimposed by motion artefacts, 
visible as modulations of the baseline of the signal. Motion stabilization 
and motion artefact removal retrieves the action potential wave activity 
(middle) with substantially reduced motion artefacts. The associated 
optical maps show spiral vortex wave patterns on the surface of the 

heart (see also Supplementary Video 12). The dominant frequency of 

the basal fibrillatory activity is 10.6 + 1.5 Hz (b). After the addition of 
cromakalim, the dominant frequency of the electrical activity increases 
markedly (compared to ref. 40) to 20.5 + 0.5 Hz, see b. At the same time, 
we measured that the heart does not exhibit any visible motion any longer 
(Supplementary Video 12). Because the contractions of the cardiac muscle 
vanish during very rapid ventricular fibrillation, optical maps and traces 
of the action potential wave activity can be retrieved without numerical 
motion tracking and motion stabilization (bottom). The modulations of 
the baseline of the signals, which can be seen during baseline ventricular 
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fibrillation (10 Hz), do not appear at very rapid ventricular fibrillation 
(20 Hz), indicating that motion is not present (Supplementary Video 

12). The data demonstrate a decoupling of the electromechanical 

wave dynamics during very rapid ventricular fibrillation induced by 
cromakalim. b, Loss of contractile activity during very rapid (20 Hz) 
ventricular fibrillation. Frequency spectra of electrical action potential 
wave pattern (green) and mechanical activity (red) measured during 
baseline ventricular fibrillation (top left, maxima at approximately 10 Hz) 
and very rapid ventricular fibrillation (bottom left, 20 Hz peak). The 
dominant frequency of the action potential wave activity approximately 
doubles from 10 Hz to 20 Hz after the addition of cromakalim, as 
previously described*’. At the same time, the contractile motion of the 
heart vanishes. Correspondingly, a frequency spectrum of the contractile 
or mechanical activity could not be computed during very rapid 
ventricular fibrillation. The histograms (middle) show large amplitudes 
of motion (1-3 pixels displacements of tissue segments within short time 
spans of 2-3 wave periods) during baseline ventricular fibrillation (top 
middle) and substantially reduced or vanishing amplitudes of motion 
(0-0.3 pixels displacements, Supplementary Video 12) during very rapid 
ventricular fibrillation (bottom middle). The amplitudes of motion were 
calculated from the displacement data obtained from the motion-tracking 
procedure. The corresponding maps (top right and bottom right) show 
the amplitudes of motion measured across the ventricular surface (yellow 
indicates 2-3 pixels displacement; blue indicates 0 pixels displacement). 
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Extended Data Figure 8 | See next page for caption. 
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Extended Data Figure 8 | Anisotropy of electromechanical rotor 
patterns in elastic excitable media (computer simulation) and effect of 
mechanical inhomogeneity onto mechanical phase singularities. 

a, b, d, Dependence of local strain rate amplitude and strain rate 


morphology on the direction of wave propagation relative to muscle fibres. 


The electrical spiral wave pattern (a) and corresponding elastomechanical 
strain rate patterns (b, d) form in two identically prepared simulation 
domains with differing underlying vertical (b) and horizontal (d) 

muscle fibre anisotropy (uniform transverse linear fibre orientation). 

c, The contractions along the fibre orientation produce stronger strain 
rate amplitudes when the wave propagates along the fibre orientation 
(green time-series in b, sampled from the green triangle) and weaker 
strain rate amplitudes when the wave propagates perpendicular to the 
fibre orientation (orange time-series in b, sampled from the orange 
triangle). Overall, the morphology of the strain rate pattern aligns with 
the fibre orientation and exhibits stronger gradients between dilating 
and contracting rates of deformation along the fibre orientation than 
perpendicular to it (polarization). The deformation data were obtained 
in quasi-2D electromechanical computer simulations with homogeneous 
active tension development and immediate electromechanical 

coupling and simulation domains with finite thickness (see Methods). 
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e-h, Sensitivity of phase singularity detection to external perturbations 
depending on the direction of wave propagation relative to fibres. 

e, Strain rate pattern superimposed with electrical (blue circle) and 
elastomechanical phase singularities (red circles) in a domain with 
uniform vertical fibre anisotropy. Perturbations caused by boundary 
conditions can distort the phase pattern and lead to the detection of 
spurious mechanical phase singularities, preferentially along the direction 
that is perpendicular to the fibre orientation. The spurious mechanical 
phase singularities propagate outwards away from the spiral wave core. 

f, Addition of a weak global strain perturbation exacerbates this effect 

in the direction that is perpendicular to the fibre orientation. g, Phase of 
strain rate shown in e used for the computation of the phase singularities 
also reveals anisotropy and polarization. h, The strong strain rate signal 
parallel to the fibre orientation (green time-series) is largely unaffected 
by a small perturbation, whereas, for the same perturbation amplitude, 
the strain rate signal in the low-amplitude region perpendicular to the 
fibre orientation is distorted more strongly (orange time-series). The 
deformation was obtained in simulations identical to those shown in a-d. 
Phase singularities from both the electrical signal and the strain rate were 
calculated using Hilbert transforms. 
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Extended Data Figure 9 | See next page for caption. 
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Extended Data Figure 9 | Measurement of intramural phase singularity 
dynamics using 2D ultrasound imaging and tracking of a non-moving 
heart versus a fibrillating contracting heart in 3D ultrasound data and 
optical mapping. a, Schematic of the experimental setup for simultaneous 
fluorescence and 2D ultrasound imaging in intact, Langendorff-perfused 
rabbit hearts. Left, ultrasound imaging (Vevo 2100, VisualSonics Inc.) 
with the cross-sectional echocardiographic imaging plane aligned 
tangentially inside the left ventricular wall (white line). Imaging plane 
facing fluorescence imaging camera (Extended Data Fig. 3d). Right, action 
potential, calcium and strain imaging setup for fluorescence imaging. 

b, Schematic of imaging configuration with action potential wave (phase) 
imaged on the surface of the heart and rotating mechanical pattern 
(phase) imaged inside the ultrasound cross-section inside the ventricular 
wall. The rendering shows a scroll wave (green, computer simulation). 

c, Optical mapping shows a counter-clockwise rotating action potential 
wave (AP, green) on the surface of a rabbit heart during ventricular 
fibrillation (Supplementary Video 9). The electrical vortex is associated 
with a counter-clockwise rotating pattern of contractile (red) and tensile 
(blue) rates of strain observed optically with fluorescence imaging on the 
surface. The mechanical deformation observed in B-mode ultrasound 
imaging in an imaging plane located beneath the imaged surface and 
aligned approximately parallel or co-planar to the epicardium shows a 
phase singularity, corresponding to a counter-clockwise rotating wave in 
mechanical deformation (Supplementary Video 9). The analysis of the 
electrical and mechanical vortices reveals co-existing phase singularities 
(indicated by white circles) on the surface and inside the ventricular wall. 
d, Tracking of the non-moving heart compared to a fibrillating contracting 
heart in 3D ultrasound and fluorescence imaging data. Distributions 
showing magnitudes of tracked displacement vectors resulting from 
motion tracking of the 3D ultrasound data (blue and red curves) and 
motion tracking during optical imaging (yellow curve). Influences by 
speckle noise or possible residual motion onto the motion tracking 

and motion analysis during ultrasound imaging are minimal in the 
Langendorff setup. Motion is not visible (Supplementary Video 13, left) 
when the heart does not contract. In comparison, the small contractions 
and deformations during ventricular fibrillation can clearly be observed 
(Supplementary Video 13, right) and can also be detected using motion- 
tracking algorithms (red curve, 3D ultrasound; yellow curve, optical 
mapping). The maximum of tracked displacements for the asystolic heart 
is approximately 0.15 mm. By contrast, during fibrillation, the maximum 
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displacements are clearly shifted towards larger values with a maximum at 
0.45 mm. The magnitudes of the tracked displacements during ventricular 
fibrillation (red curve) are confirmed by the optical measurement 
performed at the same time (yellow curve). Statistical analysis (two-sample 
Kolmogorov—Smirnov test) rejects the null hypothesis that the ultrasound- 
based measurements of tissue displacement during asystole and 
ventricular fibrillation are from the same continuous distribution at a 1% 
significance level. For the ultrasound data, the distributions were obtained 
from 274,444 voxels (asystole) and 165,558 voxels (fibrillation), for which 
the displacements were tracked in between consecutive volume frames 
(with At= 1 or At=2 or At=3) for the entire video sequence. Then only 
the maximal displacement in each voxel was stored and considered for the 
distributions to emphasize the influence of noise. Note in this context that 
in the asystolic case, tracking yields small displacement magnitudes due 

to measurement noise resulting presumably from both the algorithm and 
speckle noise. For a better comparison, the At or the temporal distance 
between volume frames between which motion was detected was adjusted. 
The asystolic data were imaged at volume rates of 62 volumes per second, 
whereas the fibrillation data was imaged at volume rates of 91 volumes 

per second. For the asystolic data (left), displacements were computed in 
between one and the second next volume frame (At= 2), resulting in an 
effective volume rate of approximately 30 volumes per second. However, 
the distribution (left) remained unchanged with At= 1, At=2 or At=3 
frames, indicating that the tissue is static and does not exhibit motion. For 
the fibrillation data (right), displacements were computed in between one 
and the third next volume frame (At= 3), resulting also in an effective 
volume rate of approximately 30 volumes per second. With At=1 or 
At=2 frames, the distribution also remains clearly distinguishable from 
the non-moving distribution and retains displacement magnitudes well 
over 0.5 voxels. Note that in this particular fibrillation dataset, the overall 
strength of motion and deformation is relatively small compared to 

other datasets (for instance the one shown in Fig. 1). In the optical data, 
displacements were computed from a frame at time f to a frame at time 

t+ 32 ms (At= 16 frames) to achieve a corresponding measurement of 
tissue displacements at imaging speeds of approximately 30 frames per 
second and to ensure comparability of the data. The data and the clearly 
distinguishable visual appearance of the non-moving and fibrillating heart 
(Supplementary Video 13) demonstrate that the spatiotemporal resolution 
of the 3D ultrasound imaging is sufficient to resolve elastomechanical 
deformation patterns during tachyarrhythmias. 
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Extended Data Table 1 | Statistics of electrical and mechanical phase singularity dynamics on the surface of fibrillating, contracting hearts 
(n=6) 


#PS #PS Ratio | Distance Dist. Dist. Ratio 
Vv M M/V | V-M[mm] | PSV [mm] | PSM[mm] | Distance 
V-M 


Pig 
75 +35 


3 | 70227 | 72229 | 10 | 43243 | 25243 | 215243 | 0.20 | 
4 | 34236 | 91e34 | 11 | 34217 | 26269 | 206269 | 0.17 
Rabbit 


Rabbit 


5 |14e14/ 28218 | 20 | 29212 | s9e12 | 70212 | 050 __ 
| 6 | 66236 | aasaa | 13 | ootos | 53¢12 | 59412 | 0.17 | 


Measurements obtained by fluorescence imaging of the left ventricular surface of fibrillating, contracting pig (n =3) and rabbit (n= 3) hearts. The fourth dataset (in the pig) was obtained in the same 
heart as the third heart, but in a different location (posterior instead of anterior left ventricular wall). Each data point corresponds to a recording, in which many hundreds of measurements were 
performed throughout the sequence of video images. The table shows similar numbers of electrical (V) and mechanical (M) phase singularities across hearts in the same species and similar ratios 
(M/V) across hearts and species, respectively. The measurements were obtained from several thousand video frames. In addition, the distances between electrical and mechanical (V-M) phase singu- 
larities are smaller than between equal (electrical-electrical or mechanical-mechanical) phase singularities across hearts and species, respectively. The data support the hypothesis that, on average, 
each electrical phase singularity can be associated with a nearby mechanical phase singularity. The average nearest neighbour distance between an electrical phase singularity and its corresponding 
mechanical phase singularity is small compared to or a fraction of the distance to the next electrical or mechanical phase singularity, respectively (in almost all cases the fraction is about 0.2). 
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Hepatocyte-secreted DPP4 in obesity promotes 
adipose inflammation and insulin resistance 


Devram S. Ghorpade!, Lale Ozcan!, Ze Zheng!, Sarah M. Nicoloro?, Yuefei Shen’, Emily Chen**, Matthias Blither', 


Michael P. Czech? & Ira Tabas»® 


Obesity-induced metabolic disease involves functional integration 
among several organs via circulating factors, but little is known 
about crosstalk between liver and visceral adipose tissue (VAT)!. In 
obesity, VAT becomes populated with inflammatory adipose tissue 
macrophages (ATMs). In obese humans, there is a close correlation 
between adipose tissue inflammation and insulin resistance*”, and 
in obese mice, blocking systemic or ATM inflammation improves 
insulin sensitivity® §. However, processes that promote pathological 
adipose tissue inflammation in obesity are incompletely understood. 
Here we show that obesity in mice stimulates hepatocytes to 
synthesize and secrete dipeptidyl peptidase 4 (DPP4), which acts 
with plasma factor Xa to inflame ATMs. Silencing expression of 
DPP4 in hepatocytes suppresses inflammation of VAT and insulin 
resistance; however, a similar effect is not seen with the orally 
administered DPP4 inhibitor sitagliptin. Inflammation and insulin 
resistance are also suppressed by silencing expression of caveolin-1 
or PAR2 in ATMs; these proteins mediate the actions of DPP4 and 
factor Xa, respectively. Thus, hepatocyte DPP4 promotes VAT 
inflammation and insulin resistance in obesity, and targeting this 
pathway may have metabolic benefits that are distinct from those 
observed with oral DPP4 inhibitors. 

In obese mice and humans, a pathway in hepatocytes involving 
Ca**-calmodulin-dependent protein kinase II (CaMKII) contributes to 
both excessive hepatic glucose production and impaired hepatic insulin 
signalling”"''. Inhibition of this pathway by hepatocyte-specific dele- 
tion of CaMKIIy in diet-induced obese (DIO) mice (Camk2g"" mice 
infected with AAV8-TBG-cre, hereafter referred to as H-CaMKII(KO) 
mice, improves glucose and insulin tolerance?!®, Notably, the VAT 
of these mice had fewer ATMs in crown-like structures (CLS) and 
decreased Adgre1 mRNA (which encodes the F4/80 glycoprotein) in 
comparison to VAT of control Camk2g"" DIO mice (Extended Data 
Fig. 1a). There was also decreased infiltration of inflammatory Ly6C™ 
monocytes and lower inflammatory cytokine expression in the VAT 
of H-CaMKII(KO) mice, with no significant change in blood mono- 
cyte number, plasma IL6 or TNFa, or liver inflammation (Extended 
Data Fig. 1b-e). ATF4 expression is regulated downstream of CaMKII 
signalling and is decreased in the hepatocytes of H-CaMKII(KO) DIO 
mice’”. Restoring ATF4 expression in the livers of mice deficient in 
hepatocyte CaMKII using an adenovirus vector restored VAT inflam- 
mation without affecting liver inflammation (Extended Data Fig. If, g). 
Conversely, hepatocyte-specific deletion of ATF4 in DIO mice lowered 
VAT inflammation (Extended Data Fig. 1h). Therefore, we hypothe- 
sized that, in obesity, activation of CaMKII and ATF4 in hepatocytes 
induces the secretion of a circulatory factor (a ‘hepatokine’) that pro- 
motes VAT inflammation. 

To test this hypothesis, we developed an ex vivo assay in which cells 
from the stromal vascular fraction (SVF) of VAT are incubated with 
plasma from the above mouse models, and Mcp1 (also known as Ccl2) 


and I/6 mRNA are quantified as markers of inflammation. Because 
VAT macrophages are a likely target of the putative hepatokine, we 
predicted that cells from the SVE of VAT from obese mice, which har- 
bour more of these macrophages** (Extended Data Fig. 2a), would 
show an increased response in this assay in comparison to SVF cells 
from lean mice. Plasma from obese mice would also be expected to 
evoke a more potent response in the assay than plasma from lean mice. 
Consistent with this hypothesis, plasma from DIO mice induced higher 
expression of both Mcp1 and I/6 mRNA than plasma from lean mice 
in SVE cells from DIO mice, but did not have this effect on SVF cells 
from lean mice. Furthermore, the inflammatory activity of plasma 
from DIO mice was due to the macrophage component of SVF cells 
(Fig. la and Extended Data Fig. 2b). DIO mouse plasma was also able 
to induce Mcp1 expression in peritoneal and bone marrow-derived 
macrophages (BMDMs; Extended Data Fig. 2c). Most importantly, 
plasma from H-CaMKII-(KO) DIO mice was less able to induce Mcp 1 
expression in SVF than plasma from either wild-type DIO mice or 
H-CaMKII(KO) mice in which hepatic ATF4 expression was restored 
(Fig. 1b). These data suggest that, in obesity, activation of hepatocyte 
CaMKII and ATF4 induce one or more secretory factors that promote 
VAT-macrophage inflammation. 

The Mcp 1-inducing factor in DIO mouse plasma was heat labile 
(Extended Data Fig. 2d), suggesting that it may be a protein. We per- 
formed size-exclusion gel-filtration fast protein liquid chromatography 
(FPLC) with DIO mouse plasma (Extended Data Fig. 2e) and tested 
eluted fractions for their ability to induce Mcp1 expression in SVF cells. 
There was a peak of activity in three fractions in the 125-200 kDa range 
(Extended Data Fig. 2f). The active fraction F44 and inactive fractions 
F42 and F46 were analysed by liquid chromatography-tandem mass 
spectrometry (LC-MS/MS) to identify peptides that were more abun- 
dant in F44 than in F42 and F46 (Supplementary Table 2a). Peptides 
corresponding to DPP4 (CD26) met these criteria (Extended Data 
Fig. 2g). DPP4 is a dipeptidyl protease that can exist as either a cell 
membrane protein or a soluble plasma protein’*. DPP4 concentration 
correlates with body mass index and insulin resistance in humans!*!4, 
and we found that DPP4 activity was higher in the plasma of DIO mice 
than in that of lean mice (Extended Data Fig. 2h). 

Consistent with a role for DPP4, the Mcp1- and Il6-inducing activity 
of DIO mouse plasma could be suppressed by an inhibitor of DPP4 
(Fig. 1c and Extended Data Fig. 2i). Furthermore, DPP4 activity in 
plasma and Dpp4 mRNA expression in liver, but not in VAT, correlated 
exactly with VAT inflammation in vivo and plasma SVF-inflammatory 
activity ex vivo in wild-type and H-CaMKII(KO) mice as well as in 
H-CaMKII(KO) mice in which hepatic ATF4 expression was restored 
(Fig. 1d and Extended Data Fig. 2j). Moreover, restoration of hepatic 
DPP4 expression in H-CaMKII(KO) mice restored DPP4 activity 
in plasma and VAT inflammation without changing body weight 
(Extended Data Fig. 3a, b). Consistent with the role of ATF4, we 
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Figure 1 | Hepatocyte-specific DPP4 silencing suppresses VAT 
inflammation and improves insulin sensitivity in DIO mice. a, VAT SVF 
cells from DIO mice, and ATMs and non-ATMs isolated from SVF, were 
incubated for 4 h with medium containing 10% (vol/vol) plasma from lean 
or DIO mice and then assayed for Mcp1 or Il6 mRNA (n= 4-6 mice per 
group; *P < 0.05 by two-tailed Student's t-test). b, VAT SVF cells from DIO 
mice were incubated with plasma from the three DIO models described 

in Extended Data Fig. 1f, g and then assayed for Mcp1 mRNA. WT, wild 
type. n= 8 mice per group; mean + s.e.m.; *P < 0.05 by one-way analysis 
of variance (ANOVA). c, SVE cells from DIO mice were incubated with 
lean or DIO mouse plasma that was pre-treated for 1 h with or without 
10,.M DPP4 inhibitor KR62436 and then assayed for Mcp1 mRNA (n=3 


found an ATF4-consensus site in exon 1 of the Dpp4 gene. Chromatin 
immunoprecipitation (ChIP) analysis revealed lower levels of ATF4 
occupancy at this site in the livers of H-CaMKII(KO) DIO mice versus 
those of wild-type DIO or H-CaMKII(KO) DIO mice in which hepato- 
cyte ATF4 expression was restored (Extended Data Fig. 3c). 

We infected DIO mice with an adeno-associated virus 8 (AAV8) that 
encoded a short hairpin RNA (shRNA) construct targeting mouse Dpp4 
driven by the H1 promoter. AAV8-H1-shDpp4 specifically silences 
DPP4 in hepatocytes but not in VAT or other tissues (Extended Data 
Fig. 3d, e); we refer to the vector as H-shDpp4 to reflect its hepatocyte 
specificity. H-shDpp4-treated DIO mice had significantly decreased 
plasma DPP4 activity (Fig. le), indicating that hepatocytes are a source 
of circulating DPP4 in obesity. H-shDpp4-treated DIO mice also had 
decreased numbers of CLS macrophages in the VAT and decreased 
levels of inflammatory cytokine mRNAs in VAT and VAT ATMs (Fig. 
1f and Extended Data Fig. 4a, b). However, these mice did not differ 
from control vector-treated DIO mice with regard to inflammation in 
inguinal or brown fat, plasma cytokines, body weight, food intake or 
liver or adipose tissue weight (Extended Data Fig. 4c—g). H-shDpp4- 
treated DIO mice also exhibited improved glucose homeostasis and 
increased insulin-induced AKT phosphorylation (p-AKT) in VAT and 
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technical replicates; mean + s.e.m.; *P < 0.05 by one-way ANOVA). d, 
The three groups of mice from Extended Data Fig. If, g were assayed for 
plasma DPP4 activity and hepatic Dpp4 mRNA (n =4 mice per group; 
mean +s.e.m.; *P < 0.05 by one-way ANOVA). e-g, Sixteen-week-old 
mice previously fed the DIO diet for 10 weeks were injected intravenously 
with AAV8-H1-shDpp4 (H-shDpp4) or control AAV8-H1 vector 
(control; or con, in f). The mice were analysed after four weeks as follows. 
e, Plasma DPP4 activity. f, CLS macrophages and Adgrel mRNA in VAT, 
with representative images of F4/80-stained VAT. g, Blood glucose after 
intraperitoneal glucose or insulin (n = 6 mice per group; mean + s.e.m.; 
*P < 0.05 by two-tailed Student’s t-test). 


liver (Fig. 1g and Extended Data Fig. 4h, i). Unlike DPP4 inhibition’, 
silencing of hepatocyte DPP4 did not increase levels of active plasma 
incretins (Extended Data Fig. 4j). Similar data were obtained when 
ob/ob mice were treated with the H-shDpp4 (Extended Data Fig. 5a—h). 
By contrast, treatment of lean mice with the H-shDpp4 had no effect 
on metabolism (Extended Data Fig. 5i-m), and DPP4 silencing did 
not improve insulin signalling in palmitate-treated primary hepato- 
cytes (Extended Data Fig. 6a). Moreover, consistent with the idea that 
adipocyte-derived non-esterified fatty acids (NEFA) may contribute 
to inflammation-induced insulin resistance!®, H-shDpp4 treatment 
lowered plasma NEFA in obese but not in lean mice (Extended Data 
Fig. 6b). These combined data suggest that the improvement in meta- 
bolism in DIO mice treated with H-shDpp4 is caused by suppression of 
VAT inflammation, which improves insulin signalling in hepatocytes 
and VAT as a secondary effect rather than via a direct effect on insulin 
signalling in hepatocytes. 

Oral DPP4 inhibitors do not lower plasma insulin in insulin-resistant, 
hyperinsulinaemic subjects'””. A recent study suggested that in obese 
mice, oral DPP4 inhibitors function by inhibiting gut endothelial 
DPP4”3, We tested the hypothesis that oral DPP4 inhibition and hepato- 
cyte DPP4 silencing have fundamentally different effects on adipose 
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Figure 2 | Silencing of hepatocyte DPP4, but not treatment with the 
oral DPP4 inhibitor sitagliptin, lowers VAT inflammation and 
improves metabolism in DIO mice. a-d, Control, H-shDpp4- and 
sitagliptin-treated mice from those described in Extended Data Fig. 6c—h 
were analysed as follows. a, Plasma active GIP (1-42) and GLP1 (7-36). 
NS, not significant. b, CLS macrophages and Adgrel mRNA in VAT, 

with representative images of F4/80-stained VAT. c, II6, Mcp1, Tnfa 

and I/1b mRNA in VAT. d, Blood glucose after oral glucose (O-GTT), 
intraperitoneal glucose (IP-GTT), or intraperitoneal insulin (IP-ITT). 
Data are mean +s.e.m., *P < 0.05 by one-way ANOVA. 


inflammation and glucose metabolism: we directly compared treatment 
with the oral DPP4 inhibitor sitagliptin with silencing of hepatocyte 
DPP4 in DIO mice. After four weeks, neither treatment affected body 
weight; both treatments lowered plasma DPP4 activity to a similar 
degree and neither treatment affected DPP4 activity in VAT (Extended 
Data Fig. 6c, d). However, H-shDpp4, but not sitagliptin, decreased 
hepatic DPP4 protein and hepatocyte DPP4 activity (Extended Data 
Fig. 6e). Conversely, sitagliptin, but not H-shDpp4, increased plasma 
incretins (Fig. 2a), consistent with our shDpp4 data and the known 
action of oral DPP4 inhibitors. Notably, H-shDpp4, but not sitagliptin, 
suppressed VAT inflammation (Fig. 2b, c), and although both treat- 
ments lowered blood glucose and improved oral glucose tolerance, 
only H-shDpp4 lowered plasma insulin, improved glucose response 
to intraperitoneal glucose and insulin, showed evidence of increased 
insulin-induced p-AKT in VAT and liver and lowered plasma NEFA 
(Fig. 2d and Extended Data Fig. 6f-h). Consistent with these results, 
both sitagliptin and H-shDpp4 treatments lowered plasma DPP4 activ- 
ity and blood glucose, but only H-shDpp4 decreased plasma insulin 
levels after 11 weeks, without a change in body weight (Extended Data 
Fig. 6i). Thus, DPP4 inhibition by orally administered sitagliptin and 
hepatocyte DPP4 silencing have fundamentally different effects on VAT 
inflammation and metabolism. 

Next, we investigated the mechanism of DPP4-induced VAT macro- 
phage inflammation. Although recombinant DPP4 (rDPP4) alone 
was unable to induce Mcp1 or II6 expression in SVE, it did so when 
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Figure 3 | DPP4 and FXa synergistically activate inflammatory 
signalling in macrophages. a, BMDMs were pretreated with or without 
10M FXa inhibitor rivaroxaban or 1011.M DPP4 inhibitor KR62436, 
followed by incubation for 4h with rFX or rDPP4 alone or in combination. 
Mcp1 mRNA was then quantified. n = 4 technical replicates per group; 
mean +s.e.m.; *P < 0.05 by one-way ANOVA. b, c, Sixteen-week-old 
mice previously fed the DIO diet for 10 weeks were treated for 20 days 
with 2 mg kg“! oral rivaroxaban twice daily (Riv) or vehicle control and 
analysed as follows. b, CLS macrophages and Adgrel mRNA in VAT, 
with representative images of F4/80-stained VAT. c, Blood glucose after 
intraperitoneal glucose or insulin (n= 5 mice per group; mean + s.e.m.; 
*P < 0.05 by two-tailed Student's t-test). d, e, BMDMs from wild-type, 
CAV1 knockout (CAV1 KO) or PAR2 knockout (PAR2 KO) mice were 
incubated for 4h with or without rFXa and rDPP4 and then assayed for 
Mcp1 and Il6 mRNA. n=4 technical replicates per group; mean + s.e.m.; 
*P < 0.05 by two-way ANOVA. 


added to either DIO mouse plasma that had been immunodepleted of 
DPP4 or plasma from DIO mice that had been treated with H-shDpp4 
(Extended Data Fig. 7a, b). To determine whether an additional plasma 
factor was needed for DPP4 to promote SVF inflammation, plasma 
from DIO mice was immunodepleted of DPP4 and then fractionated 
using FPLC. Two fractions (F43 and F44) were able to induce Mcp1 in 
SVE cells in the presence of rDPP4, but not in its absence (Extended 
Data Fig. 7c). Fraction F44 and the inactive fractions F42 and F46 were 
then analysed by LC-MS/MS to identify peptides that were more abun- 
dant in F44 versus F42 and F46 (Supplementary Table 2b). Peptides cor- 
responding to plasma factor X met these criteria (Extended Data Fig. 
8a). Factor X can trigger inflammation in endothelial cells and leuko- 
cytes”*?°, and we found that the ability of DIO plasma to induce Mcp1 
expression in SVF cells or macrophages was abrogated by rivaroxaban, 
an inhibitor of factor Xa (FXa) (Extended Data Fig. 8b). Furthermore, 
whereas recombinant factor X (rFX) or rDPP4 alone led to only very 
small increases in Mcp1 and II6 expression in macrophages, combined 
treatment with both rFX and rDPP4 was much more potent, and this 
inflammatory activity was inhibited by either rivaroxaban or a DPP4 
inhibitor (Fig. 3a and Extended Data Fig. 8c). These data also imply 
that factor X is converted to FXa by macrophages”®. Using recombinant 
FXa (rFXa) directly, we showed that rFXa and rDPP4 together were able 
to induce Mcp] and II6 expression in DIO mouse SVF cells but not in 
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lean mouse SVF cells (Extended Data Fig. 8d). Treatment of DIO mice 
with rivaroxaban lowered VAT inflammation and plasma NEFA, and 
improved metabolism, without affecting body weight (Fig. 3b, c and 
Extended Data Fig. 8e-}). 

To understand how the combination of DPP4 and FXa induces 
Mcp1 expression, we pre-treated macrophages with rFXa and then, 
after removing the rFXa and inhibiting residual activity with rivar- 
oxaban, treated the cells with rDPP4. We also conducted the reverse 
experiment by pre-treating the cells with rDPP4 and then, after remov- 
ing the rDPP4 and inhibiting residual activity with a DPP4 inhibitor, 
treated the cells with rFXa. Both protocols increased Mcp1 and II6 
mRNA expression (Extended Data Fig. 9a, b), suggesting that neither 
a complex between FXa and DPP4 nor a specifically ordered prim- 
ing event is needed. We therefore hypothesized that each factor alone 
partially activates inflammatory signalling to a sub-threshold level, 
and that the threshold is reached only when both factors are present. 
Inflammation induced by rDPP4 and rFXa was markedly reduced 
in macrophages from mice lacking either caveolin-1 (CAV 1), which 
mediates the inflammatory effects of DPP4 on antigen-presenting 
cells?’, or PAR2, a cell-surface receptor that mediates the inflam- 
matory effects of FXa in macrophages”® (Fig. 3d, e). Similar results 
were observed in VAT-derived ATMs from DIO mice, and human 
macrophages treated with inhibitors of CAV1 or PAR2 (Extended 
Data Fig. 9c, d). Consistent with the signalling-threshold hypothesis, 
rDPP4 alone increased phosphorylation of two reported CAV] sig- 
nalling intermediates, IRAK1 and TAK] (Extended Data Fig. 9e), and 
rFXa alone increased phosphorylation of the PAR2 signalling inter- 
mediate RAF1 (Extended Data Fig. 9f), but only combined treatment 
with rDPP4 and rFXa was able to activate two distal inflammatory 
signalling molecules, ERK1/2 and NF-«B (Extended Data Fig. 9g). 
Pharmacological inhibition of these signalling molecules abolished 
rDPP4-rFXa-induced Mcp1 and II6 expression (Extended Data Fig. 
9c, h-j). Together, these data suggest that DPP4 and FXa activate two 
separate upstream pathways that synergistically stimulate ERK1/2 and 
NF-kB to induce Mcp] and I/6 in ATMs. 

Next, we determined whether inhibition of the above pathways in 
ATMs could ameliorate VAT inflammation and improve metabolism 
in obese mice. Genes can be silenced specifically in ATMs in ob/ob 
mice by intraperitoneal administration of short interfering RNA 
(siRNA) encapsulated in micrometer-sized glucan shells, referred to 
as glucan-encapsulated siRNA particles (GERPs)*. Using this method, 
we determined the effect of decreasing Par2 and Cav1 expression in 
ATMs on VAT inflammation and glucose metabolism in ob/ob mice. 
Ina pilot experiment using FITC-labelled glucan shells in ob/ob mice, 
we verified their increased localization to macrophages in VAT (ATMs) 
relative to macrophages in other organs (Extended Data Fig. 10a). 
We then treated ob/ob mice with GERPs containing scrambled RNA 
(control), or siRNA targeting PAR2 or CAV1, over a 12-day period. 
Body weight was similar in all three cohorts (Extended Data Fig. 10b). 
PAR2 and CAV1 levels were decreased by the respective GERPs in 
ATMs but not in spleen (Extended Data Fig. 10c). Notably, both GERPs 
lowered VAT inflammation, improved glucose intolerance, increased 
insulin-induced p-AKT in VAT and liver and decreased plasma NEFA 
(Fig. 4a-d and Extended Data Fig. 10d, e). 

These findings emphasize the important role of crosstalk between 
hepatocytes and adipose tissue in metabolic disease (Fig. 4e) and the 
distinct effects of silencing hepatocyte DPP4 expression versus DPP4 
inhibition by oral inhibitors, which are widely used to treat type 2 dia- 
betes. Although sitagliptin has been reported to lower adipose inflam- 
mation in obese mice”, the drug was administered to young mice when 
they started receiving a high-fat diet, leading to an approximately 25% 
decrease in body weight in comparison to untreated mice. In our study, 
the drug was given to adult mice after obesity and VAT inflammation 
had developed, and it had no effect on body weight. We speculate that 
inhibition of circulating DPP4 activity does not block VAT inflam- 
mation in adult obese mice because some other action of oral DPP4 
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Figure 4 | Silencing PAR2 or CAV1 in ATMs in obese mice lowers VAT 
inflammation and improves response to insulin. a—d, Five-week-old, 
chow-fed ob/ob mice were injected intraperitoneally once a day for 12 
days with 0.2 mg GERPs containing scrambled siRNA (control), Par2 
siRNA (siPar2) or Cav1 siRNA (siCav1). Twenty-four hours after the last 
injection, the mice were analysed as follows. a, CLS macrophages and 
Adgrel mRNA in VAT, with representative images of F4/80-stained VAT. 
b, Blood glucose and plasma insulin 5h after food withdrawal. c, Blood 
glucose after intraperitoneal glucose or insulin. d, Total AKT and p-AKT 
in VAT and liver after portal vein insulin injection. n = 5-6 mice per 
group; *P < 0.05 by two-tailed Student’s t-test for each siRNA GERP versus 
control GERP. For gel source data, see Supplementary Fig. 1. e, Summary 
scheme: obesity triggers a Ca”t-CaMKII-ATF4 pathway in hepatocytes, 
leading to induction of DPP4 and secretion of soluble DPP4 (sol-DPP4). 
Soluble DPP4 activates a CAV1-IRAK1-TAK1 pathway in ATMs, which, 
in combination with PAR2-RAFI activation by FXa in ATMs, promotes 
ERK1/2-NF-kB-mediated inflammation. VAT inflammation exacerbates 
glucose intolerance, impaired insulin signalling in adipose and liver and 
hyperinsulinaemia. 


inhibition in that setting, perhaps related to the increase in plasma 
insulin*°, counteracts the anti-inflammatory effect of lowering plasma 
DPP4 activity. While future work is needed to understand the role of 
the hepatocyte DPP4-FXa pathway in obese, insulin-resistant humans, 
we have preliminary data that shows that a high percentage of obese 
subjects with fat inflammation and a high homeostasis model assess- 
ment (HOMA) index have inflammatory activity in their plasma that is 
dependent on DPP4, FXa and PAR2, whereas plasma from obese sub- 
jects without fat inflammation and insulin resistance do not show this 
dependency. In conclusion, the identification of this pathway provides 
insight into how organ crosstalk can exacerbate metabolic disarray in 
obesity and raises the possibility that therapeutic silencing of hepato- 
cyte DPP4— via hepatocyte-targeted siRNA®*’, for example—may have 
benefits for obesity-induced insulin resistance that are not achievable 
through currently available oral DPP4 inhibitors. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 

Reagents and antibodies. Recombinant mouse and human DPP4 (catalogue 
no. 954-SE and 9168-SE, respectively) and anti-DPP4 antibody (AF954) were 
from R&D Systems. The PAR2 inhibitor GB83 was from Axon Medchem (1622). 
Recombinant FX (233282) and FXa (233526), and the MEK inhibitor PD98059 
(513000) were from Millipore. The FXa inhibitor rivaroxaban ($3002) was from 
Selleckchem. IRAK1-4 inhibitor I (A3505) and sitagliptin phosphate monohydrate 
(A4036) were from ApexBio. Tri reagent (T9424), sodium palmitate (P9767), 
collagenase (C2139), liberase (5401020001), insulin (19278), the DPP4 inhibitor 
KR-62436 hydrate (K4264), the caveolin-1 inhibitor daidzein (D7802), PS-1145, 
IKK inhibitor (P6624), the TAK1 MAPKKK inhibitor 5Z-7-oxozeaenol (09890), 
anti-phospho-Thr209-IRAK1 (SAB4504246), and anti-3-actin antibody (A3854) 
were from Sigma. Anti-phospho-S473-AKT (4060), anti-AKT (4691), anti- 
phospho-Thr202/Tyr204-ERK1/2 (8544), anti-ERK (9194), anti-phospho-S536- 
NF-KB p65 (3033), anti- NF-kB p65 (8242), anti-phospho-S412-TAK1 (9339), 
anti-TAK1 (5206), anti-phospho-S338-cRAF1 (9427), anti-cRAF1 (53745) and 
anti-caveolin-1 (3267) antibodies were from Cell Signaling. Anti-PAR2 anti- 
body (817201) was from Biolegend. The DIO high-fat diet (60% kcal from fat) 
(D12492) was from Research Diets. Adenoviral Atf4 (adeno-Atf4) was a gift from 
R. J. Kaufman (Sanford—Burnham Medical Research Institute), adenoviral Dpp4 
(adeno-Dpp4) was purchased from Vector Biolabs, and adeno-associated virus 
8 (AAV8) containing either hepatocyte-specific TBG-Cre recombinase (AAV8- 
TBG-cre) or the control vector (AAV8-TBG-lacZ) were purchased from the 
Penn Vector Core. All adenoviruses were amplified by Viraquest. The AAV8-H1- 
shRNA construct targeting murine DPP4 was made by annealing complementary 
oligonucleotides and then ligating them into the pAAV-RSV-GFPH1 vector, as 
described previously**. The resultant constructs were amplified by Salk Institute 
Gene Transfer, Targeting, and Therapeutics Core. siRNA sequences against 
murine Par2 were purchased from GE Dharmacon and siRNA sequences against 
mouse Cav] were purchased from Integrated DNA Technologies. 

Mouse experiments. Camk2¢" mice were generated as previously described!” and 
crossed onto the C57BL6/J background. Atf4" mice were generated as previously 
described**, Male Camk2e™" or Atf4™" mice were fed a high-fat, high-calorie diet 
(HED, 60% kcal from fat) for 13 weeks starting at three weeks of age and were 
maintained on a 12-h light-dark cycle. Fifteen-week-old high-fat-diet-fed obese 
mice (DIO) that had been fed with HFD for nine weeks, four-week old chow-fed 
ob/ob mice, or sixteen-week-old chow-fed wild-type C57BL/6J mice were pur- 
chased from Jackson Laboratories and maintained for one week in the animal 
facility before commencement of experiments. Sixteen-week-old DIO Camk2g!"!! 
mice that had been fed with HFD for 13 weeks were treated with adeno-associated 
viruses (AAV) for expression of either hepatocyte-specific TBG-cre*4 or TBG-lacZ 
(the control vector) for three weeks to obtain H-CaMKII(KO) and wild-type mice. 
To obtain ATF4-restored H-CaMKII(KO) or DPP4-restored H-CaMKII(KO) mice, 
TBG-cre-treated Camk2g! mice received either adeno-Atf4 (H-CaMKII(KO) 
plus adeno-Atf4) or adeno-DPP4 (H-CaMKII(KO) plus adeno-Dpp4), while the 
remaining mice received adeno-lacZ. Similarly, sixteen-week-old Atf4“" DIO mice 
that had been fed with HFD for 13 weeks were treated with AAV8-TBG-cre or 
AAV8-TBG-lacZ to obtain hepatocyte-ATF4 KO (H-ATF4(KO)) and wild-type 
mice. Sixteen-week-old wild-type DIO mice, four-week old ob/ob and sixteen- 
week-old wild-type C57BL/6] mice were treated with AAV8-H1-shDpp4 or 
AAV8-H1-control to obtain hepatocyte-specific DPP4 silencing (H-shDpp4) 
or control mice, respectively. In all mouse experiments, recombinant adenovi- 
rus (1 x 10° plaque-forming units per mouse) or adeno-associated virus (1 x 101! 
genome copies per mouse) was delivered by tail vein injection, and experiments 
were commenced after 7-28 days or, for the experiment in Extended Data Fig. 
6i, after 11 weeks. DIO mice were treated with oral rivaroxaban (2 mg/kg) twice 
daily for 20 days. DIO mice were treated for four or seven weeks with sitagliptin by 
adding it to the drinking water at 0.3 mg/ml, which results in a dose of ~30-45 mg/ 
kg/day. Mouse plasma samples were collected from lean or obese mice after 4-5h 
of food withdrawal, with free access to water. Blood glucose was measured after 
4—5h of food withdrawal in mice using a glucose meter (One Touch Ultra, Life- 
scan). Plasma insulin levels were measured using the Ultrasensitive Mouse Insulin 
ELISA Kit (Crystal Chem, 90080) and plasma MCP1, IL6 and TNFa levels were 
measured using mouse ELISA kits from RayBiotech (ELM-MCP1-1; ELM-IL6-1; 
ELM-TNFa-1). For gastric inhibitory polypeptide (GIP) and glucagon-like pep- 
tide-1 (GLP-1) measurements, mice were fasted overnight and then fed for 4-5h 
before collecting plasma samples into a tube containing DPP4 inhibitor (5|1M). 
Plasma active GIP was measured using Sandwich mouse active GIP (1-42) ELISA 
Kit (Crystal Chem, 81511) and GLP-1 was assayed in plasma samples using mouse 
active GLP-1 (7-36) ELISA kit (Eagle Biosciences, GP121-K01). Plasma non-esteri- 
fied fatty acids (NEFA) were assayed using an enzymatic kit from Wako Diagnostics 
(HR Series NEFA-HR(2)-Colour Reagent A, 999-34691; Solvent A, 995-34791; 


Colour Reagent B, 991-34891; Solvent B, 993-35191; and NEFA standard solution, 
276-76491). Intraperitoneal-glucose tolerance tests were performed by intraperi- 
toneal injection of glucose (1 g/kg body weight for DIO, 0.5 g/kg body weight for 
ob/ob and 2 g/kg body weight for lean mice) following an overnight fast. Oral 
glucose tolerance tests were conducted by delivering a 2-g/kg glucose bolus orally 
via gavage to DIO mice after a 5-h fast. Insulin tolerance tests were performed in 
mice that were fasted for 4—5 h, by intraperitoneal insulin injection (0.6 IU/kg body 
weight for DIO, 2 IU/kg body weight for ob/ob and 0.5 IU/kg body weight for lean 
mice) and by assaying blood glucose at various time points. Previous studies and 
pilot experiments formed the basis of power calculations for the various studies. 
Depending on the experiment, calculations indicated that 3-12 mice per group 
would enable the testing of our hypotheses based on an expected 25-30% coef- 
ficient of variation and an 80% chance of detecting a 33% difference in the key 
specified endpoints (P < 0.05). For all experiments, male mice of the same age 
and similar weight were randomly assigned to experimental and control groups. 
On occasion, we analysed a subset of mice for a particular parameter, and the 
subset was chosen randomly from a full cohort. Pre-specified exclusion criteria 
were weight loss 10% of initial body weight or signs of illness or injury requiring 
euthanasia. According to these pre-specified criteria, the maximum number of 
mice removed before analysis was three, but was more typically between zero and 
two. Animal studies were conducted in accordance with the Columbia University 
Animal Research Committee. Moreover, because switching mice from group cag- 
ing to single caging, improper handling and placing mice in a restrainer affect 
stress-sensitive metabolic parameters including plasma DPP4 levels and adipose 
tissue inflammation (D.S.G., L.O. and LT., unpublished observations), all experi- 
mental mice were housed in groups to avoid stress from isolation and handled with 
extra care to avoid stress during blood glucose measurement and blood collection. 
F4/80 immunostaining of VAT. VAT samples were fixed in 10% (v/v) formalin 
solution for 24h, followed by incubation with 70% ethanol (v/v) for 24h and then 
embedding in paraffin. Six-micrometre sections were mounted on charged glass 
slides, incubated at 60°C for 20 min, and deparaffinized using three washes with 
xylene. Antigen retrieval was carried out by treating the sections with proteinase 
K (1:1,000) (Invitrogen, AM2546) at 37°C for 25 min. The sections were then 
rinsed and blocked using 5% serum, followed by incubation overnight at 4°C 
with anti-F4/80 biotin-conjugated primary antibody (1:100) (Miltenyi Biotec, 130- 
101-893). The sections were then incubated with streptavidin-HRP secondary 
antibody (1:200) (BD Pharmingen, 554066) and then developed in chromagen 
substrate 3,3’-diaminobenzidine (Cell Signaling, 8059) and counterstained with 
haematoxylin. For each mouse, 5-10 sections were analysed by microscopic imag- 
ing at 20x magnification. The total number of nuclei and the number of nuclei 
of F4/80-immunostained cells surrounding adipocytes in CLS were counted for 
each field. The data are expressed as mean percentage of CLS macrophages per 
total adipose tissue nuclei. 

Portal vein insulin infusion and protein extraction from tissues. After 4-5h 
of food withdrawal, mice were anaesthetized and insulin (0.6 IU/kg body weight 
for DIO, 2 IU/kg body weight for ob/ob and 0.5 IU/kg body weight for lean mice) 
or PBS was injected through the portal vein. Three minutes after injection, liver 
and adipose tissue were removed, frozen in liquid nitrogen, and kept at —80°C 
until processing. For protein extraction, tissue samples were cut into small pieces 
and transferred to tubes containing ice-cold RIPA buffer supplemented with Halt 
Protease and Phosphatase Inhibitor Cocktail (Thermo Scientific, 78440). Tissue 
segments were then homogenized on ice, lysates were centrifuged, and the super- 
natant fractions were used for immunoblot analysis. 

Primary hepatocyte experiments. Wild-type C57BL/6] mice (8-10 weeks old) 
were injected with either control vector or AAV8-shDpp4 (H-shDpp4) via tail 
vein injection. Ten days after injection, primary mouse hepatocytes were isolated 
as previously described*®. Hepatocytes were incubated with either BSA control or 
palmitate (50|1M) for 10h, with the last 5h in serum-free medium. The cells were 
then treated with 100 nM insulin or vehicle control for 5 min and immediately 
frozen in liquid nitrogen and kept at —80°C until further processing. Cells were 
lysed using 2x Laemmli buffer supplemented with Halt Protease and Phosphatase 
Inhibitor Cocktail and lysates were used for immunoblotting. 

Preparation of stromal vascular fraction cells and isolation of adipose tis- 
sue macrophages. SVF from lean or obese mice was prepared as previously 
described*®. In brief, VAT was isolated in ice-cold PBS containing liberase (50 1g/ 
ml) and minced into small segments. VAT was digested at 37 °C for 1h with inter- 
mittent mixing. After digestion, the solution was centrifuged, buoyant adipocytes 
were removed, and the cell pellet was retrieved as SVE. SVF cells were then cul- 
tured in medium containing 10% (vol/vol) FBS or plasma from lean or obese mice. 
For isolation of ATMs, anti-F4/80 biotin-conjugated primary antibody (Miltenyi 
Biotec, 130-101-893) was added at a 1:10 dilution to 4 x 10° SVE cells in 200 il of 
PBS containing 5% of FBS (PBS-FBS), followed by incubation at 4°C for 20 min. 
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The SVF cells were pelleted by centrifugation, washed once with 1 ml of buffer, and 
resuspended in 200,11 of PBS-FBS. The cells were then incubated with streptavi- 
din-conjugated magnetic microbeads (1:10) (Miltenyi Biotec, 130-048-101) at 4°C 
for 20 min. The cells were rinsed once with PBS-FBS, and ATMs were isolated 
using magnetic separation columns (Miltenyi Biotec, 130-042-201) and cultured 
in medium containing 10% (vol/vol) FBS. 

Immunoblotting. Proteins from tissue or cell lysates were resolved on 4-20% 
Tris—glycine gradient gels and then transferred to nitrocellulose membranes. The 
membranes were blocked using either 5% non-fat milk or 5% BSA followed by 
incubation with primary antibodies overnight. Blots were then washed thoroughly 
and probed with HRP-conjugated secondary antibodies for 1h. Protein bands 
were detected with Supersignal West Pico enhanced chemiluminescent solution 
(Thermo Scientific, Cat. No. 34080) and analysed by ImageJ. 

Quantitative reverse transcription-PCR. Total RNA was extracted from tissues 
or primary cell cultures using Tri-reagent (Sigma, T9424). cDNA was synthesized 
from 2 1g total RNA using oligo (dT) and Superscript II (Invitrogen, AM5730G 
and 18064014). Real time quantitative (q)PCR was performed with a 7500 Real- 
Time PCR System (Applied Biosystems) using SYBR Green master mix (Qiagen, 
204145) and gene-specific primers. The sequences of gene-specific primers used 
for the qPCR assays are described in Supplementary Table 1. 

DPP-4 immunodepletion from plasma. DPP-4 was immunodepleted from obese 
mouse plasma using bead-immobilized anti- DPP-4 antibody (R&D Systems, 
AF954) as previously described*”. In brief, 101g of anti-DPP-4 antibody was con- 
jugated with 1.5 mg of Protein-G Dynabeads (Invitrogen, 10004D) in 200 1l of 
binding buffer. After immobilization of anti-DPP4 antibody to beads, 4011 of DIO 
mouse plasma and 60 11 of PBS were mixed with 40 1l of antibody-coated bead sus- 
pension. The resultant mixture was incubated for 2.5h at 4°C on a rotating wheel. 
After 2.5h, Dynabeads were separated with a magnet and discarded. This was 
followed by one more round of immunocapture of DPP4 and discarding of beads. 
DPP-4 proteolytic activity assay was performed on immunodepleted plasma to 
confirm DPP-4 immunodepletion. For some experiments, DIO mouse plasma was 
incubated with anti-DPP4-conjugated or control IgG-conjugated beads, followed 
by elution of the bound material using 50 mM glycine buffer, pH 2.8. 

Separation of plasma proteins using FPLC. Lean or obese mouse plasma (0.2 ml) 
was diluted 1:1 with PBS and injected onto a Superdex 200 Increase 10/300GL 
FPLC column (GE Healthcare, 28990944). Plasma proteins were eluted from the 
column at a constant flow of 0.3 ml/min with PBS (pH 7.4). The effluent was col- 
lected in 0.25-ml fractions. The collected fractions were frozen in dry ice and kept 
at -80°C until further processing. 

In-gel protein digestion. Inflammatory obese mouse plasma, DPP4-depleted 
obese mouse plasma fractions or DPP4 immunoprecipitated from obese mouse 
plasma (IP-DPP4) were resolved on a 4-20% Tris—glycine gradient gel and then 
stained with GelCode Blue Safe Coomassie stain (Thermo Scientific, 24594) for 
2h followed by de-staining with double distilled water overnight. The resolved 
proteins were excised and processed at Herbert Irving Comprehensive Cancer 
Center Proteomics Shared Resource, Columbia University. The excised gel pieces 
were rehydrated and digested in 80,11 of 12.5 ng/l Trypsin Gold in 50 mM ammo- 
nium bicarbonate at 37°C overnight. After digestion was complete, condensed 
evaporated water was collected from the tube walls by brief centrifugation using 
a benchtop microcentrifuge (Eppendorf). The gel pieces and digestion reaction 
were mixed with 501L 2.5% trifluoroacetic acid (TFA) and rigorously mixed for 
15min. The solution with extracted peptides was transferred into a fresh tube. The 
remaining peptides were extracted with 8011 70% acetonitrile-5% TFA mixture 
using rigorous mixing for 15 min. The extracts were pooled and dried to comple- 
tion (1.5-2h) in a SpeedVac. The dried peptides were reconstituted in 3011 0.1% 
TFA by mixing for 5 min and stored in ice or at —20°C before analysis. 
LC-MS/MS analysis. The concentrated peptide mix was reconstituted in a solution 
containing 2% acetonitrile and 2% formic acid for mass spectrometry analysis. 
Peptides were eluted from the column using a Dionex Ultimate 3000 Nano liquid 
chromatography system with a 10-min gradient from 2% buffer B to 35% buffer B 
(100% acetonitrile, 0.1% formic acid). The gradient was switched from 35% to 85% 
buffer B over 1 min and held constant for 2 min. Finally, the gradient was changed 
from 85% buffer B to 98% buffer A (100% water, 0.1% formic acid) over 1 min, 
and then held constant at 98% buffer A for a further 5 min. The application of a 
2.0-kV distal voltage electrosprayed the eluting peptides directly into the Thermo 
Fusion Tribrid mass spectrometer equipped with an EASY-Spray source (Thermo 
Scientific). Mass spectrometer-scanning functions and HPLC gradients were con- 
trolled by the Xcalibur data system (Thermo Finnigan). 

Database search and interpretation of MS/MS data. Tandem mass spectra from 
raw files were searched against a mouse protein database using the Proteome 
Discoverer 1.4 software (Thermo Finnigan). The Proteome Discoverer applica- 
tion extracts relevant MS/MS spectra from the .raw file and determines the pre- 
cursor charge state and the quality of the fragmentation spectrum. The Proteome 
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Discoverer probability-based scoring system rates the relevance of the best matches 
found by the SEQUEST algorithm. The mouse protein database was downloaded as 
FASTA-formatted sequences from the Uniprot protein database (database released 
in September, 2014). The peptide mass search tolerance was set to 10 p.p.m. 
A minimum sequence length of seven amino acids residues was required. Only 
fully tryptic peptides were considered. To calculate confidence levels and false 
discovery rates (FDR), Proteome Discoverer generates a decoy database contain- 
ing reverse sequences of the non-decoy protein database and performs the search 
against this concatenated database (non-decoy + decoy). Scaffold (Proteome 
Software) was used to visualize searched results. The discriminant score was set 
at less than 1% FDR, determined on the basis of the number of accepted decoy 
database peptides, to generate protein lists for this study. Spectral counts were used 
for estimation of relative protein abundance between samples. 

Mouse liver nuclei preparation and chromatin immunoprecipitation assays. 
Mouse liver tissues were homogenized using a Dounce homogenizer (Wheaton) 
with the loose pestle in 1:10 (w:v) of ice-cold NP-40 lysis buffer supplemented with 
protease inhibitor cocktail. Liberation of nuclei was monitored by DAPI staining 
and fluorescence microscopy. To purify intact nuclei, lysates were then layered 
over a sucrose gradient consisting of 1 M sucrose overlaid with 0.68M sucrose 
and then centrifuged at 4000 r.p.m. for 20 min at 4°C. Following a washing step, 
nuclear pellets were cross-linked with 1% fresh formaldehyde in PBS for 10 min 
at room temperature. Cross-linking was terminated by addition of 200 mM Tris- 
HCl (pH 9.4) and 1mM DTT for 10 min followed by centrifugation at 2,500 r.p.m. 
for 10 min at 4°C. Nuclear pellets were suspended in SDS lysis buffer containing 
protease inhibitors, incubated for 10 min on ice, and DNA was sheared in a cold 
water bath in a focused-ultrasonicator (Covaris, $2) to obtain DNA fragments with 
an average size of 500 bp. Fragmented chromatin was pre-cleaned by incubating 
with normal rabbit IgG (Santa Cruz, sc-2025) for 1h at 4°C, followed by 1h of 
incubation with 50 11 protein G magnetic beads (Pierce) at 4°C with rotation. 
A rabbit anti- ATF4 antibody (Cell Signaling, 11815) to pull down ATF4-binding 
complexes, and a control rabbit anti-haemagglutinin antibody (Santa Cruz) were 
used to pull down non-specific binding complexes. Immunoprecipitated chromatin 
fragments were reverse cross-linked, digested by proteinase K, and purified using 
QlAquick PCR Purification Kit (Qiagen, 28104). The presence of ATF4 in the Dpp4 
exonic region was quantified by qPCR and expressed relative to the input genomic 
DNA. The sequences of primers used for the ChIP-qPCR assays are described in 
Supplementary Table 2. 

Macrophage assays. ATMs, peritoneal macrophages or BMDMs were incubated 
for 4h in medium containing 10% plasma from lean or obese mice and then 
assayed for Mcp1 or I/6 mRNA using quantitative reverse transcription-PCR 
(RT-qPCR). ATMs, bone marrow-derived macrophages or human macrophages 
were incubated for 4h with 0.25 U/ml rFXa, 175 ng/ml rDPP4, or both proteins 
together. The cells were then assayed for Mcp1 or II6 mRNA using RT-qPCR. 
Glucan-encapsulated siRNA particle (GERP) study in ob/ob mice. FITC-labelled 
glucan shells (GS) containing non-targeting (control) siRNA or siRNAs against 
Par2 or Cav1 were prepared and loaded as previously described**””. In brief, 2.4mg 
of FITC-labelled glucan shells-lectin, 7.2 nmol of siRNA, and 120 nmol of endo- 
porter were combined to form FITC-GERPs. Five-week old ob/ob mice received 12 
daily doses of FITC-GERPs by intraperitoneal injection, which contained 0.2 mg 
FITC-GS, 0.6 nmol siRNA and 10 nmol endoporter. Glucose tolerance and insulin 
tolerance tests were performed on days seven and ten of GERP treatment, respec- 
tively. Mice were euthanized 24h after the last injection. 

Statistical analysis. All results are presented as mean + s.e.m. For experiments 
with two groups, P values were calculated using the two-tailed Student's t-test 
for normally distributed data and the Mann-Whitney rank sum test for non- 
normally distributed data. For experiments with more than two groups, P values 
were calculated using one-way or two-way ANOVA for normally distributed data 
and the Kruskal-Wallis by ranks test for non-normally distributed data, with 
Student-Newman-Keuls or Dunn’s post hoc test. Error bars that appear to be 
absent from certain graph symbols signify s.e.m. values smaller than the graph 
symbols. 

Data availability. The data that support the findings of this study are available 
from the corresponding author upon reasonable request. 
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Extended Data Figure 1 | Hepatocyte-specific deletion of CaMKII or 
ATF4 in DIO mice lowers VAT inflammation. a—c, Sixteen-week-old 
Camk2g!""' mice that had been previously fed the DIO diet for 13 weeks 
were injected intravenously with AAV8-TBG-cre (H-CaMKII(KO)) 

or AAV8-TBG-lacZ (wild-type, WT). Mice were analysed after three 
additional weeks on the DIO diet. a, Representative images of VAT 
immunostained for F4/80, with quantification of crown-like-structure 
(CLS) macrophages, and expression of Adgre] mRNA, which encodes 
F4/80. b, As in a, except that the mice were injected with fluorescent 
beads using a procedure that labels circulating Ly6c! monocytes, and 
then bead-labelled cells were assayed in VAT sections. c, mRNAs for 1/6, 
Mcp1, Tnf and Il1b in VAT. d, Blood monocyte count. e, Plasma IL6 and 
TNFa measured by ELISA, and quantification of Adgrel, Mcp1 and Tnfa 
mRNA in liver. In a-e, n= 4 mice per group; mean + s.e.m.; *P < 0.05; 
n.s., not significant by two-tailed Student’s t-test. f, g, Mice similar to those 
in a-c, and from a third group in which Camk2g¢" mice were injected 


intravenously with adeno-Atf4 and AAV8-TBG-cre (H-CaMKII(KO) 

+ adeno-Atf4). f, CLS macrophages and Adgrel mRNA in VAT were 
quantified, with representative images of F4/80-stained VAT. g, 1/6, Mcp1, 
Tnfa and Il11b mRNA in VAT and Adgre1, Mcp1 and Tnfa mRNA in liver 
were quantified. Note that the first two groups of mice received adeno-lacZ 
instead of adeno-Atf4. n= 4 mice per group; mean +s.e.m.; *P < 0.05 by 
one-way ANOVA; n.s., not significant. h, AAV8-TBG-cre (H-ATF4(KO)) 
or AAV8-TBG-lacZ (wild-type) was injected intravenously into 16-week- 
old Aft4" mice previously fed the DIO diet for 13 weeks. After three 
further weeks on the DIO diet, VAT from these mice was immunostained 
for F4/80 to identify macrophages, the percentage of macrophages in CLS 
was quantified, and the VAT was assayed for Adgre1 and the indicated 
inflammatory mRNAs. Twelve wild-type and 11 H-ATF4(KO) mice was 
analysed for CLS macrophages, and a randomly selected subset of five 
wild-type and five H-ATF4(KO) mice was analysed for the VAT mRNAs. 
Mean +s.e.m., *P < 0.05 by two-tailed Student’s t-test. 
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Extended Data Figure 2 | DPP4 in the plasma of DIO mice induces 
Mcp1 and II6 in SVF cells from the VAT of obese mice and in 
macrophages. a, Representative images of haemotoxylin and eosin-stained 
VAT sections from lean and DIO mice and Adgre1 mRNA levels in SVF 
from lean and DIO mice. n= 4 mice per group; mean + s.e.m.; *P < 0.05. 
b, SVE cells from the VAT of lean mice were incubated for 4h in medium 
containing 10% (v/v) plasma from lean or DIO mice and then assayed 
for Mcp1 and Il6 mRNA. n=4 mice per group; mean + s.e.m.; *P < 0.05; 
n.s., not significant. c, Mcp1 mRNA was assayed in mouse peritoneal 
macrophages (Ms) or bone marrow-derived macrophages that were 
incubated for 4h with medium containing 10% (v/v) plasma from 

lean or DIO mice. n=5 technical replicates per group; mean + s.e.m.; 

*P <0.05. d, Mcp1 mRNA levels in SVF cells that were incubated for 

4h with medium containing 10% (v/v) control or heat-inactivated 

(heat) plasma from the indicated groups of mice. n = 6 mice per group; 
mean + s.e.m.; *P < 0.05. e, UV protein chromatogram obtained after 


fractionation of DIO mice plasma using gel-filtration FPLC; vertical grey 
bar depicts peak of activity shown in f. f, Obese mouse SVF cells were 
incubated with medium containing 10% lean or DIO mouse plasma or 

the indicated FPLC fractions from e and assayed for Mcp1 mRNA. n.d., 
Mcp1 mRNA not detected. Arrows indicate the fractions that were selected 
for LC-MS/MS analysis. g, LC-MS/MS normalized spectral counts 
corresponding to DPP4 in the FPLC fractions from f. h, DPP4 activity in 
the plasma of lean and DIO mice. n=5 mice per group; mean + s.e.m.; 

* P< 0.05. i, SVE cells from DIO mice were incubated for 4h with medium 
containing 10% (v/v) lean or DIO mouse plasma that was pre-treated for 
1h with or without 104M DPP4 inhibitor KR62436. The cells were then 
assayed for IJ6 mRNA. n= 3 technical replicates per group; mean + s.e.m.; 
*P <0.05. j, VAT from the mice in Extended Data Fig. 1f, g was assayed 
for Dpp4 mRNA (n= 4). Data in a-c and h were analysed by two-tailed 
Student's t-test; data in i and j were analysed by one-way ANOVA (g, h); 
data in d were analysed by two-way ANOVA. 
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Extended Data Figure 3 | Restoration of DPP4 in livers of 
H-CaMKII(KO) mice abrogates suppression of VAT inflammation 
in DIO mice; ATF4 ChIP of the Dpp4 gene; and AAV8-H1-shDpp4 
treatment lowers hepatic DPP4. Wild-type and H-CaMKII(KO) mice 
and a third group in which Camk2g"" mice were injected intravenously 
with adeno-Dpp4 together with the AAV8-TBG-cre (H-CaMKII(KO) 

+ adeno-Dpp4) were analysed as follows. a, Body weight, plasma DPP4 
activity and liver and VAT DPP4 protein. b, Representative images of 
VAT immunostained with F4/80 antibody, with quantification of CLS 
macrophages and Tufa and Mcp1 mRNA in VAT. Note that the first two 
groups of mice received adeno-lacZ instead of adeno-DPP4. Ina andb, 
n= 3-4 mice per group; mean + s.e.m.; *P < 0.05 by one-way ANOVA; 
n.s., non-significant. For gel source data, see Supplementary Fig. 1. c, Top, 
ChIP was performed with liver extracts from the indicated groups of 
mice using anti-ATF4 or control IgG antibodies. The region spanning 
the predicted ATF4-binding sequence in exon 1 of Dpp4 was amplified 
by RT-qPCR and normalized to the values obtained from the input. 
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n=3 ChIP assays for wild-type, H-CaMKII(KO), H-CaMKII(KO) + 
adeno-Atf4; n= 1 for H-ATF4(KO); n= 10 for control IgG; mean + s.e.m.; 
#*P <0.05 by one-way ANOVA for groups 1-3. Bottom, per cent input 
for ATF4 ChIP using liver extracts from the indicated DIO mice and 
PCR primers for a region in the Dpp4 gene that does not contain a 
consensus sequence for ATF4 binding. n= 3 ChIP assays for wild type, 
H-CaMKII(KO), H-CaMKII(KO) + adeno-Atf4; n= 1 for H-ATF4(KO); 
n= 10 for control IgG; mean + s.e.m.; n.s., non-significant by one-way 
ANOVA. d, e, Sixteen-week-old mice previously fed with the DIO diet 
for 10 weeks were injected intravenously with hepatocyte-specific AAV8- 
H1-shDpp4 (H-shDpp4) or control AAV8-H1 vector (con). After four 
additional weeks on DIO diet, the mice were analysed as follows. d, DPP4 
immunoblot, with densitometric quantification of DPP4 protein in the 
indicated tissues; representative of three independent experiments. 

e, Dpp4 mRNA in liver and VAT. In d and e, n=5 mice per group; 

mean + s.e.m.; *P < 0.05; n.s., non-significant by two-tailed Student’s 
t-test. See Supplementary Fig. 1 for gel source data. 
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Extended Data Figure 4 | Silencing of DPP4 in liver suppresses VAT inflammatory gene mRNA expression in perirenal, inguinal and brown 
inflammation and improves metabolism without increasing plasma fat. e, Plasma MCP1, IL6 and TNFa. f, Body weight and food intake. 
incretins. a, Top, 16-week-old mice previously fed with the DIO diet g, Weights of liver and the indicated adipose tissue depots. In a-g, n = 5-6 
for 10 weeks were injected intravenously with AAV8-H1-shDpp4 mice per group; mean + s.e.m.; *P < 0.05; n.s., non-significant by two- 
(H-shDpp4) or control AAV8-H1 vector (con). After four weeks, the tailed Student's t-test. h-j, Mice similar to those in b-g were assayed. 
mice were assayed for I/6, Mcp1, Tnfa and Il11b mRNA in VAT. Bottom, h, Blood glucose and plasma insulin. i, p-AKT and total AKT in VAT 
control and H-shDpp4-treated mice similar to those above were analysed and liver after insulin injection into the portal vein. n = 1 PBS-injected, 
nine days after adenovirus injections for I16, Mcp1, Tnfa and Il1lb mRNA n=4 insulin-injected mice per group; blots are representative of three 
in ATMs. b-g, H-shDPP4 and control mice were analysed after four independent experiments. Gel source data are shown in Supplementary 
weeks. b-d, Representative images of adipose tissue immunostained with Fig. 1. j, Plasma active GIP (1-42) and GLP1 (7-36). In h andj, n=6 mice 
F4/80 antibody, with quantification of CLS macrophages, Adgre1 and per group; mean +s.e.m.; *P < 0.05; n.s., non-significant by two-tailed 
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Extended Data Figure 5 | Hepatocyte-specific silencing of DPP4 improves 
glucose metabolism in ob/ob mice without increasing plasma incretins 
and does not affect VAT inflammation or glucose metabolism in lean 
mice. a-h, Five-week-old chow-fed ob/ob mice were injected intravenously 
with AAV8-H1-shDpp4 (H-shDpp4) or AAV8-H1-control (con), and were 
assayed four weeks later. a, Body weight. b, Immunoblot of liver and VAT 
DPP4. c, Representative images of VAT immunostained with F4/80 antibody, 
with quantification of CLS macrophages and Adgrel mRNA in VAT. d, II6, 
Mcp1, Tnfa and Il1b mRNA in VAT. e, Blood glucose after challenge with 
intraperitoneal glucose or insulin. f, p- AKT and total AKT in VAT and liver 
extracts after portal vein insulin injection. g, Blood glucose and plasma 


insulin 5h after food withdrawal. h, Plasma active GIP (1-42) and GLP1 
(7-36). i-m, Sixteen-week-old chow-fed wild-type lean mice were injected 
intravenously with AAV8-H1-shDpp4 (H-shDpp4) or AAV8-H1-control 
(Con) and were analysed as follows. i, Body weight. j, Immunoblot of DPP4 
in liver and VAT. k, p-AKT and total AKT in VAT and liver extracts after 
portal vein insulin injection. 1, Blood glucose and plasma insulin 5 h after 
food withdrawal. m, Blood glucose after challenge with intraperitoneal 
glucose or insulin. In all panels, n = 5-6 mice per group; mean +s.e.m.; 
n.s., non-significant by two-tailed Student's t-test. For gel source data, 

see Supplementary Fig. 1. 
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Extended Data Figure 6 | Effect of DPP4 silencing on insulin-induced 
p-AKT in primary hepatocytes, on NEFA in obese or lean mice and 

on metabolism in comparison with sitagliptin in obese mice. a, Wild- 
type or DPP4-silenced primary hepatocytes were treated with or without 
50M palmitate for 10 h with the last 5h in serum-free medium, and then 
stimulated with 100 nM of insulin for 5 min. p-AKT and total AKT and 
DPP4 were assayed by immunoblot. The data are representative of two 
independent experiments. b, Plasma samples from the following mice were 
assayed for non-esterified fatty acids (NEFA) four weeks after intravenous 
injection with AAV8-H1-shDpp4 (H-shDpp4) or control AAV8-H1 vector 
(con): 16-week-old mice fed the DIO diet for the last 10 weeks, 5-week- 
old chow-fed ob/ob mice, and 6-week-old chow-fed wild-type lean mice. 
n=5-6 mice per group; mean + s.e.m.; *P < 0.05 and n.s., non-significant 
by two-tailed Student's t-test. c-h, After 10 weeks on high-fat diet, 
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control DIO mice, H-shDpp4 DIO mice and DIO mice were administered 
sitagliptin (sita) in drinking water to achieve a dose of ~30-45 mg/kg/ 
day. After 4 weeks of treatment, the mice were analysed as follows. c, Body 
weight. d, Plasma and VAT DPP4 activity. e, Hepatic DPP4 immunoblot 
and hepatocyte DPP4 activity. f, Blood glucose and plasma insulin 5 h after 
food withdrawal. g, p-AKT and total AKT in VAT and liver after portal 
vein insulin injection. h, Plasma non-esterified fatty acids (NEFA). For all 
quantified data panels except e, n = 6-8 mice per group; for e, n =4 mice 
per group; mean + s.e.m.; **#P < 0.05 and n.s., non-significant by one-way 
ANOVA. For gel source data, see Supplementary Fig. 1. i, DIO mice were 
treated with AAV8-H1-con, AAV8-H1-shDpp4 or sitagliptin (+ AAV8- 
H1-con) exactly as above, except the mice were analysed after 11 weeks of 
treatment instead of after 4 weeks. n =5 mice per group; mean + s.e.m.; ** 
P<0.05 and n.s., non-significant by one-way ANOVA. 
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were incubated for 4h with medium containing 10% (v/v) DIO mouse ANOVA). c, DPP4-depleted plasma from DIO mice was fractionated by 
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Extended Data Figure 8 | DPP4 and FXa synergistically promote 

VAT inflammation, and inhibition of FXa improves glucose 
metabolism. a, LC-MS/MS normalized spectral counts corresponding 
to FX in the indicated FPLC fractions from Extended Fig. 7c. b, SVF 

cells or BMDMs were pre-incubated for 1 h with or without 101M FXa 
inhibitor rivaroxaban (riv) and then incubated with medium containing 
10% (v/v) plasma from lean or DIO mice and assayed for Mcp1 mRNA. 
n=3 technical replicates per group; mean + s.e.m.; **P < 0.05 by two- 
way ANOVA. c, BMDMs were pretreated with or without 10 1M FXa 
inhibitor rivaroxaban (riv) or 10 1M DPP4 inhibitor KR62436, followed by 
incubation for 4 h with rFX or rDPP4 alone or both together. [J6 mRNA 
was then quantified. n =4 technical replicates per group; mean +s.e.m.; 
**P <0.05 by one-way ANOVA. d, SVF cells from lean or DIO mice were 
treated with rFX or rDPP4 alone or in combination for 4h, and then Mcp1 


1.18 40.54] 2.86 + 0.16* 


and Il6 mRNA levels were assayed (n= 4 technical replicates per group; 
mean + s.e.m.; *P < 0.05 and n.s., non-significant by one-way ANOVA). 
e-j, The control and rivaroxaban-treated mice from Fig. 3b, c were 
assayed. e, Body weight. f, Adgre1, Mcp1, Il6, Tnfa and Il1b mRNA in VAT. 
g, h, Blood glucose and plasma insulin 5 h after food withdrawal. i, p-AKT 
and total AKT in VAT and liver after portal vein insulin injection. j, Plasma 
non-esterified fatty acids (NEFA). In e-h and j, n=5 mice per group; 
mean +s.e.m.; *P< 0.05 and n.s., non-significant by two-tailed Student’s 
t-test. In i, the mean fold increases of the plus-insulin values relative to 

the minus-insulin value, based on the densitometric ratio of p-AKT:total 
AKT, are shown below the blots (n = 1 PBS-injected and n= 4-insulin 
injected mice per group; mean + s.e.m.; *P < 0.05). For gel source data, see 
Supplementary Fig. 1. 
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Extended Data Figure 9 | See next page for caption. 
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Extended Data Figure 9 | DPP4 and FXa synergistically induce 
inflammatory signalling pathways in macrophages. a, BMDMs were pre- 
treated for 4h with or without 0.25 U rFXa, medium was then removed, 
cells were washed and treated with 101M rivaroxaban (riv) to inhibit 
residual FXa activity. The cells were then incubated for 4h with or without 
70 ng rDPP4 and assayed for Mcp1 and IJ6 mRNA. b, BMDMs were pre- 
treated for 4h with or without 70 ng rDPP4, medium was then removed, 
cells were washed and treated with 101.M KR62436 to inhibit residual 
DPP4 activity. The cells were then incubated for 4h with or without 0.25 U 
rFXa and assayed for Mcp1 and IJ6 mRNA. Ina and b, n= 3 technical 
replicates per group; mean + s.e.m.; *P < 0.05 versus all other groups by 
one-way ANOVA. c, ATMs were pre-treated for 1h with or without 101.M 
of the PAR2 inhibitor GB83, 251M of the CAV1 inhibitor daidzein, 101M 
of the MEK inhibitor PD98059 or 101M of the IKK inhibitor PS-1145. 
The cells were then incubated for 4h with or without rFXa and rDPP4 

and assayed for Mcp1 and Il6 mRNA. n=4 technical replicates per group; 
mean + s.e.m.; **P < 0.05 by one-way ANOVA. d, Human monocyte- 
derived macrophages were pre-treated for 1h with or without 10 1M PAR2 
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inhibitor (GB83) or 2541M CAV1 inhibitor (daidzein) and then 

incubated for 4h with or without rFXa, rDPP4 or both and assayed 

for MCP1 and IL6 mRNA. n=3 technical replicates; mean + s.e.m.; 

**P < 0.05 versus all other groups by one-way ANOVA. e-g, Bone 
marrow-derived macrophages were incubated for 10 min with or 

without 0.25 U rFXa, 70 ng rDPP4 or rFXa plus rDPP4 and then assayed 
by immunoblot for phosphorylated and total IRAK1, TAK1, RAF1, 
ERK1/2 and P65, followed by densitometric quantification. The data are 
representative of two (e, f) or three (g) independent experiments. For 

all panels, n = 3 technical replicates per group; mean + s.e.m.; *P < 0.05 
versus other groups and n.s., not significant by one-way ANOVA. 

h-j, Bone marrow-derived macrophages were pretreated for 1h with or 
without the following inhibitors. h, IRAK-1/4 inhibitor-I, 0.5 1M. i, TAK1 
MAPKKK inhibitor Z-7-oxozeaenol, 0.1 1M. j, RAF1 inhibitor GW5074, 
141M. Cells were then incubated for 4h with or without rFXa plus rDPP4 
and assayed for Mcp1 and I]6 mRNA. n= 4 technical replicates per group; 
mean + s.e.m.; *P < 0.05 by one-way ANOVA. 
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containing scrambled siRNA (control), Par2 siRNA or Cav1 siRNA. 
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The protein histidine phosphatase LHPP is a tumour 


suppressor 


Sravanth K. Hindupur!, Marco Colombi!, Stephen R. Fuhs?, Matthias S. Matter*, Yakir Guri!, Kevin Adam2, Marion Cornu!, 
Salvatore Piscuoglio’, Charlotte K. Y. Ng, Charles Betz', Dritan Liko', Luca Quagliata’, Suzette Moes', Paul Jenoe’, 
Luigi M. Terracciano*, Markus H. Heim*, Tony Hunter? & Michael N. Hall! 


Histidine phosphorylation, the so-called hidden phosphoproteome, 
is a poorly characterized post-translational modification of 
proteins)”. Here we describe a role of histidine phosphorylation 
in tumorigenesis. Proteomic analysis of 12 tumours from an 
mTOR-driven hepatocellular carcinoma mouse model revealed 
that NME1 and NME2, the only known mammalian histidine 
kinases, were upregulated. Conversely, expression of the putative 
histidine phosphatase LHPP was downregulated specifically in the 
tumours. We demonstrate that LHPP is indeed a protein histidine 
phosphatase. Consistent with these observations, global histidine 
phosphorylation was significantly upregulated in the liver tumours. 
Sustained, hepatic expression of LHPP in the hepatocellular 
carcinoma mouse model reduced tumour burden and prevented 
the loss of liver function. Finally, in patients with hepatocellular 
carcinoma, low expression of LHPP correlated with increased 
tumour severity and reduced overall survival. Thus, LHPP is a 
protein histidine phosphatase and tumour suppressor, suggesting 
that deregulated histidine phosphorylation is oncogenic. 

Liver cancer is the second leading cause of cancer-related deaths 
globally. Hepatocellular carcinoma (HCC) represents approximately 
90% of primary liver cancer cases*, Recent studies indicate that almost 
50% of HCC cases display aberrant PISK-AKT-mTOR signalling’, 
including loss of the tumour suppressors PTEN, TSC1, and TSC2? 
(Fig. 1a). We generated an HCC mouse model by liver-specific dele- 
tion of PTEN and TSC1, using the neonatally expressed albumin 
promoter as a cre driver (see Methods). We hereafter refer to this 
model as liver-specific double-knockout (L-dKO) mice. L-dKO mice 
invariably exhibited hepatomegaly and advanced liver tumours at 6 
and 20 weeks of age, respectively (Fig. 1b and Extended Data Fig. 1a). 
Histopathological analysis (Fig. 1b) and molecular classification based 
on mRNA expression of the liver cancer markers CD133 (also known 
as Prom1), CD90 (Thy1), Cd44 and Aldh1 (Aldh1a1) revealed that the 
L-dKO liver tumours indeed mimicked poorly differentiated human 
HCC (Extended Data Fig. 1b), consistent with other studies®®. 

We performed quantitative proteomic analysis on three tumours 
each from four 20-week-old, L-dKO mice (12 tumours in total; see 
Methods). Liver protein extracts from six cre-negative age- and sex- 
matched littermates were pooled in equal amounts and used as a 
control (Extended Data Fig. 1c). We identified approximately 4,500 
proteins per tumour (Extended Data Fig. 1d). Of the 3,147 proteins 
that were detected in a minimum of 10 tumours, 433 proteins were 
upregulated and 262 proteins were downregulated compared to the 
control (Fig. 1c). We manually curated a list of known or putative phos- 
phatases and kinases significantly deregulated in a minimum of ten 
tumours (Extended Data Fig. le). Particularly intriguing in this list 
were the upregulated histidine kinases NME1 and NME2 (also known 
as NDPKA and NDPKB, respectively) (Fig. 1d), and the downregulated 


putative histidine phosphatase LHPP (phospholysine phosphohistidine 
inorganic pyrophosphate phosphatase). 

LHPP has been purified from bovine liver and shown to hydrolyse 
P-N bonds in synthetic substrates in vitro’~’, but is otherwise poorly 
characterized. LHPP is evolutionarily conserved from worm to human 
(overall 28% identity) (Extended Data Fig. 2a), but only weakly homo- 
logous to the two defined mammalian histidine phosphatases PHPT 1 
(8.1% identity) and PGAM5 (6.4% identity)!°- (Extended Data Fig. 2b). 
We note that PHPT1 and PGAMS levels were unchanged in tumours 
from L-dKO mice (Fig. 1d). NME1 and NME2 are homologous 
proteins (88% identical) that function as histidine kinases as either 
homo- or hetero-oligomers. NME1 and NME2 are the only mamma- 
lian protein histidine kinases reported so far'*"!°. Immunoblot analysis 
confirmed that LHPP was downregulated and NME1 and NME2 were 
upregulated (Fig. 2a and Extended Data Fig. 3a) in L-dKO tumours, 
as initially observed by proteomic analysis. Immunoblot analysis of 
S6 protein phosphorylated at Ser240 and Ser244 (S6-pS240/244) and 
AKT phosphorylated at Ser473 (AKT-p$473) confirmed high mTORC1 
and mTORC2 activity, respectively, in L-dKO liver tissue and tumours 
(Fig. 2a, b). To determine whether the loss of LHPP expression coin- 
cided with tumorigenesis, we examined whole liver tissue (6, 12 and 
16 weeks) and isolated tumours (20 weeks) from L-dKO mice at differ- 
ent ages and hence at different stages of tumour development (Fig. 2b). 
Age-matched cre-negative littermates served as controls. LHPP expres- 
sion was mildly reduced in the livers of L-dKO mice at 16 weeks, at 
which age macroscopic tumours were first visible. As observed previ- 
ously, LHPP was strongly decreased in tumours of 20-week-old mice. 
As expected, mTOR signalling was high at all ages. Thus, the reduction 
in LHPP expression coincided with tumour development, but did not 
correlate with mTOR activity. Hepatic NME1 and NME2 expression 
decreased in control mice in an age-dependent manner, but remained 
high in L-dKO liver tissue and tumours. Thus, increased NME1 and 
NME2 expression in L-dKO mice correlated with increased mTOR 
activity rather than with tumour development (Fig. 2b), suggesting 
that mTOR may control NMEI and NME2 expression. To determine 
whether loss of LHPP expression was specific to tumours, we per- 
formed immunoblot analysis on dissected tumour and non-tumour 
liver tissue from 20-week L-dKO mice. LHPP expression was reduced 
(or absent) in the tumour, compared to the non-tumour liver tissue 
(Extended Data Fig. 3b). Immunohistochemistry on liver from 20-week 
L-dKO mice confirmed that LHPP was strongly downregulated (unde- 
tectable) in tumours compared to non-tumour tissue (Extended Data 
Fig. 3c). Like mTOR activity, NME1 and NME2 expression was simi- 
larly upregulated in both tumour and non-tumour L-dKO liver tissue 
compared to control liver tissue (Extended Data Fig. 3b, c), again indi- 
cating that the expression of NME1 and NME2 correlated with mTOR 
activity rather than tumorigenicity. These findings suggest that LHPP 
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Figure 1 | Proteomics on L-dKO tumours. a, Diagram of mTOR 
signalling showing TSC and PTEN tumour suppressors. b, Top, 
representative images of whole livers from 20-week-old L-dKO (tumours 
indicated with arrowheads) and control mice. Bottom, representative 
photomicrographs of haematoxylin and eosin (H&E)-stained liver 

from control mice, tumour (T) and adjacent non-tumour (NT) tissue 
from L-dKO mice (20 weeks). Similar results were obtained in tumours 
from nine mice. c, Number of tumour proteins detected and quantified by 
mass spectrometry. A total of 7,753 proteins were quantified in all tumours 
(n= 12). 3,147 proteins were quantified in a minimum of 10 tumours, 

of which 433 proteins were upregulated (red), 262 were downregulated 
(blue), and 2,452 were unchanged (yellow). ANOVA-based two-sample 
t-test with a false discovery rate (FDR) of 2% was used to determine 
regulation. d, Mass spectrometry-determined fold change in independent 
L-dKO tumours (n = 12 for NME1, NME2, LHPP and PHPT1; n=9 for 
PGAMS) compared to control livers (n = 6); 

data are mean +s.d. PHPT1 and PGAM5 are two defined mammalian 
histidine phosphatases. 


inactivation, in a background of constitutively high NME1 and NME2 
activity in L-dKO liver, is a key event in tumorigenesis. 

The above findings suggest that the expression of NME1 and NME2, 
but not LHPP, is controlled by mTOR. To investigate this further, we 
acutely treated (24h) L-dKO mice with the mTORC1 inhibitor rapa- 
mycin or the pan-mTOR inhibitor INK128. INK128 treatment reduced 
NME1I and NME2 expression in L-dKO tumours, but had no effect on 
LHPP (Extended Data Fig. 3d). Rapamycin had no effect on the level 
of NME1, NME2 or LHPP (data not shown). Thus, the expression of 
NME1 and NME2 seems to be mTORC2 dependent, whereas LHPP 
expression is mTOR independent. 

The paired upregulation of NME1 and NME2 (histidine kinases) and 
downregulation of LHPP (putative histidine phosphatase) prompted us 
to investigate protein histidine phosphorylation in L-dKO tumours?"®. 
Phosphohistidine (pHis) exists in two isomers, 1-pHis and 3-pHis, 
depending on the position of the phospho-acceptor nitrogen in the 
histidine imidazole ring (N1 and N3 positions, respectively)”!”. Unlike 
the phosphoester (P-O) bond in other phosphoamino acids, the phos- 
phoramidate (P-N) bond in phosphohistidine is heat- and acid-labile, 
thus making pHis difficult to detect in biological samples'®. This difficulty 
has been largely circumvented by the recent development of monoclonal 
antibodies that specifically recognize 1-pHis or 3-pHis*. The new 
antibodies were used to assess histidine phosphorylation (1-pHis and 
3-pHis) in L-dKO tumours and control liver (Fig. 3a) (see Methods). 
The immunoblot signals of both 1-pHis and 3-pHis were high in 
tumours compared to control tissue (Fig. 3a, b). Heating sample-buffer- 
solubilized lysates at 95°C for 10 min before SDS-PAGE caused 
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dephosphorylation, indicating that the signals detected in the unheated 
samples are indeed histidine-phosphorylated proteins (Fig. 3a). 
As reported previously with cultured cells”, multiple 3-pHis-positive 
proteins but only one or a few 1-pHis proteins were detected in tumours 
and liver tissue. This could be due to the lower thermodynamic stability 
of 1-pHis than of 3-pHis'®”°, or the existence of fewer proteins phos- 
phorylated at the N1 position. The major band observed with the 
1-pHis antibody is likely to be co-migrating NME1-pHis118 and 
NME2-pHis118 (Fig. 3a), because it displays an appropriate molecu- 
lar mass (18kDa) for NME1 and NME2 and was lost upon NME1 
and NME2 knockdown (data not shown). NME1 and NME2 normally 
autophosphorylate position N1 of His118, and then transfer this phos- 
phate to histidine (N1 or N3) in target proteins (Extended Data Fig. 4a). 
Thus, consistent with increased levels of NME1-pHis118 and NME2- 
pHis118 and decreased LHPP levels, histidine phosphorylation was 
significantly increased in L-dKO tumours. 

To identify proteins in which histidine phosphorylation is increased 
in tumours, we used a combination of anti-1-pHis and anti-3-pHis 
monoclonal antibodies to immunoprecipitate pHis proteins from four 
L-dKO tumours and control tissue. Subsequent mass spectrometry 
identified 1,173 and 823 proteins from tumour and control liver tissue 
lysates, respectively (Supplementary Tables 1 and 2). In total, 260 
proteins were more abundant (at least twofold) in tumours compared 
to control tissue (Supplementary Table 3). Only 18 proteins were more 
abundant in control tissue (see below for more discussion). 

We next investigated whether LHPP is indeed a protein phosphohis- 
tidine phosphatase. First, we assayed LHPP phosphohistidine phos- 
phatase activity in vitro (see Methods). Recombinant LHPP treatment 
reduced 3-pHis levels in an electrophoretically fractionated CB1 cell 
lysate transferred to a membrane (Extended Data Fig. 4b). CB] is a cell 
line derived from L-dKO tumours (see Methods) and thus displays 
high levels of histidine phosphorylation. Second, we assessed LHPP 
phosphatase activity in vivo. Restoration of Lhpp expression in CB1 
cells, using an adenovirus delivery system, reduced 3-pHis 3.1-fold with 
negligible effect on 1-pHis, pSer/Thr or pTyr (Fig. 3c, d, Extended Data 
Fig. 4c). Similar results were obtained with regard to pHis (3.9-fold 
drop in 3-pHis, no effect on 1-pHis) after introducing mouse Lhpp into 
human SNU449 cells, a liver cancer cell line with low intrinsic LHPP 
levels (Extended Data Fig. 4d, e). Immunoblot analysis confirmed the 
expression of recombinant Lhpp in both cell lines (Fig. 3c and Extended 
Data Fig. 4d). Collectively, these results indicate that LHPP is a histidine 
phosphatase broadly acting on N3-phosphorylated proteins. We note 
that PGAM5 was previously demonstrated” to be a 1-pHis phosphatase 
acting specifically on NME2-pHis1 18. 

To identify potential LHPP targets, we immunoprecipitated 
N3-phosphorylated proteins from CB1 cells lacking and re-expressing 
LHPP. Mass spectrometry revealed nine proteins that were more abun- 
dant (at least 1.7-fold) in CB1 cells that lack LHPP (Supplementary 
Table 4). These included ACLY (ATP citrate lyase), known to be 
N3-phosphorylated at His760 (see below). 

We next examined further the role of the phosphatase LHPP in pro- 
liferation and tumorigenesis. Increased expression of LHPP in CB1 
and SNU449 cells significantly impaired cell proliferation and, in CB1 
cells, reduced hepatosphere formation (Extended Data Fig. 4f, g). 
Conversely, knocking down the already low level of LHPP expression in 
SNU449 cells enhanced cell proliferation (Extended Data Fig. 5a). Next, 
we investigated the effect of LHPP expression on tumour formation 
in L-dKO mice. Hepatocyte-specific, adeno-associated virus (AAV) 
expressing either red fluorescent protein (RFP) (AAV/DJ-ALBp-REFP, 
hereafter referred to as AAV-control) or RFP and murine LHPP (AAV/ 
DJ-ALBp-RFP-2A-mLhpp, hereafter referred to as AAV-LHPP) was 
injected into the tail vein of 8-week-old L-dKO and littermate con- 
trol mice (Fig. 3e). Tumour burden was assessed at 20 weeks of age. 
Hepatic overexpression of LHPP in AAV-LHPP- infected L-dKO mice 
(20 weeks) was confirmed by immunoblotting (Extended Data Fig. 5b). 
Macroscopic and histological examination of livers revealed few and 
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Figure 2 | Loss of LHPP expression is confined to L-dKO tumours. 

a, Immunoblot analysis indicates reduced LHPP and increased NME1 
and NME2 expression in tumours compared to age-matched littermate 
control (liver samples from control mice (n = 4) and two tumour samples 
each from four 20-week-old L-dKO mice (n= 8)). For full scans, see 
Supplementary Fig. 1. b, Immunoblot analysis indicates reduced LHPP 
expression (16 and 20 weeks (w)) and increased NME1 and NME2 
expression (12, 16 and 20 weeks) in L-dKO liver (6, 12 and 16 weeks) and 
tumours (20 weeks) compared to age-matched control mice (6, 12 and 

16 weeks, n = 4, liver tissues from two mice are pooled per lane). For full 
scans, see Supplementary Fig. 2. 


small (micronodular) tumours in AAV-LHPP-infected L-dKO mice 
(Fig. 3e-g). The few ‘escaper’ tumours that arose in AAV-LHPP- 
infected L-dKO mice displayed reduced LHPP expression, again 
suggesting that LHPP downregulation is essential for tumorigenesis. 
AAV-control-infected L-dKO livers exhibited severe HCC with multiple 
large neoplastic (macronodular) tumours. Serum levels of alanine 
amino transferase (ALT), aspartate amino transferase (AST) and lactate 
dehydrogenase (LDH), well characterized indicators of liver damage, 
were also reduced in AAV-LHPP-infected L-dKO mice compared to 
AAV-control-infected mice (Extended Data Fig. 5c). No detrimental 
effect was observed in control mice injected with either AAV-control 
or AAV-LHPP (Extended Data Fig. 5c). Thus, the expression of LHPP 
prevents tumour formation and maintains liver function in L-dKO 
mice, suggesting that the protein histidine phosphatase LHPP is a 
tumour suppressor. 

We next examined NME1, NME2 and LHPP expression in human 
liver tumours, from patients with HCC. Resected HCC tissue and 
adjacent non-tumour tissue were assessed for LHPP, NME1 and 
NME2 levels by immunoblot analysis. LHPP was low in the tumour 
(HCC) compared to non-tumour tissue in several patients (Fig. 4a), 
as observed in L-dKO mice. NME1 and NME2 expression was high 
in the tumours. We note that human NME1 and NME2, unlike their 
mouse counterparts, do not co-migrate (Fig. 4a). In agreement with the 
immunoblot analysis, immunohistochemistry also showed that LHPP 
expression was low and NME1 and NME2 expression was high in HCC 
compared to adjacent non-tumour tissue from two additional patients 
(Fig. 4b). Furthermore, using a tissue microarray assay, we evaluated 
LHPP expression in tumour and matched non-tumour liver samples 
from 20 patients with HCC. Tumour sections that had greater than 70% 
tumour cells and non-tumour sections that contained greater than 70% 
normal cells were chosen for the array (see Methods). LHPP expres- 
sion was significantly downregulated in HCC compared to matched 
non-tumour tissue (Extended Data Fig. 6a—c). Similarly, on the basis 
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Figure 3 | LHPP is a protein histidine phosphatase and a tumour 
suppressor. a, b, Immunoblot analysis (a) and quantification (b), shows 
increased 3-pHis and 1-pHis signals in L-dKO tumours (20 weeks) 
compared to control (ctrl) littermates. Similar results were observed in 
tumours from L-dKO mice (n = 4). Tissue lysates in 2x sample buffer 

(pH 8.8) heated at 95°C for 10 min were dephosphorylated and served 

as a control. For full scans, see Supplementary Fig. 5. In the 3-pHis blot, 
the area enclosed in the dashed line was used for quantification (n = 4). 
Adjusted *P = 0.0460, **P =0.0051, two-sided, Holm-Sidak’s 

multiple comparisons test. Data are mean + s.d. M, molecular mass. 

c, Immunoblotting shows a reduction in 3-pHis levels in CB1 cells infected 
with adenovirus (adeno) containing LHPP, compared to cells infected with 
a control adenovirus. Similar results were observed in three biological 
experiments. For full scans, see Supplementary Fig. 7. d, Quantification of 
immunoblots indicate that 3-pHis signals are significantly reduced upon 
LHPP re-expression compared to control-infected cells (n = 3 biological 
repeats). ****P < 0.0001, two-sided ratio paired t-test. NS, not significant. 
Data are mean +s.d. e, Top, experimental timeline. AAV-control or AAV- 
LHPP was injected into the tail vein of 8-week-old L-dKO and littermate 
control mice. Bottom, representative images of whole livers showing 
reduction in L-dKO liver tumour burden after infection with AAV-LHPP 
compared to infection with AAV-control (arrowheads indicate tumours). 
Similar results were observed in five different L-dKO mice infected with 
AAV-LHPP. f, Photographs of tumours isolated from L-dKO mice infected 
with AAV-LHPP and AAV-control. g, Quantification of tumours in L-dKO 
mice showing a significant reduction in tumour burden after infection 
with AAV-LHPP (n=5 mice), compared to AAV-control (n= 6 mice). 
8 D < ().0001, two-sided unpaired t-test. Data are mean +s.d. 


of a published mRNA expression dataset from 37 patients with HCC”|, 
LHPP mRNA levels (but not PHPT1 or PGAM5 mRNA levels) were 
significantly reduced in tumour tissue compared to matched non- 
tumour tissue (Extended Data Fig. 6d). We note that, in L-dKO mice, 
reduced LHPP protein expression also correlated with reduced Lhpp 
mRNA levels (Extended Data Fig. 1f). Finally, consistent with increased 
levels of NME1 and NME2 and decreased levels of LHPP (Fig. 4c), we 
observed significantly higher 1-pHis and 3-pHis signals in human HCC 
compared to matched non-tumour tissue (Fig. 4c, d). Collectively, the 
above findings suggest that histidine phosphorylation is upregulated 
in both mouse and human tumours. 

We next investigated clinical correlates of LHPP expression. HCC 
is scored based on Edmondson grades I to IV in which grade IV is 
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Figure 4 | Loss of LHPP expression in human HCC correlates with 
reduced patient survival. a, b, Immunoblot (” = 3 patients) (a) and 
immunohistochemistry (b) (n = 2 patients) analyses showing reduced 
LHPP and increased NME1 and NME2 expression in liver tissue from 
patients with HCC compared to adjacent non-tumour liver tissue in a total 
of n=7 HCC patients (m = 2 other patients in panel c). For full scans, see 
Supplementary Fig. 12. c, Immunoblot analysis showing increased 3-pHis 
and 1-pHis signals in human HCC compared to adjacent non-tumour 
tissue. Similar results were observed in tumours from two additional HCC 
patients (total n = 4 patients). For full scans, see Supplementary Fig. 13. 
Tissue lysates in 2x sample buffer (pH 8.8) heated at 95°C for 10 min to 
dephosphorylate pHis served as control. The monoclonal antibodies 

used to detect 3-pHis and 1-pHis are SC44-1 and SC1-1, respectively. 

d, Quantification of immunoblot (from c and two other western blots, 

n=4 in total) indicates that 3-pHis and 1-pHis signals are significantly 
higher in human HCC compared to adjacent non-tumoral tissue (band 
intensities in each lane are normalized to the intensity of corresponding 
calnexin protein levels after subtracting the intensities from corresponding 
heated lane). The y axis represents band intensities. *P = 0.0368, 

** P = 0.0064, two-sided ratio paired t-test. Data are mean +s.d. 

e, f, Kaplan-Meier curves, log-rank test, showing the correlation 

between LHPP expression and clinical outcome, as analysed for disease- 
free survival (e) and overall survival (f). 


the most severe or aggressive. Examination of a previously published 
mRNA dataset”! revealed that low LHPP expression is associated with 
more severe tumours (Edmondson grades III/IV versus I/II) (Extended 
Data Fig. 6e). These results, combined with our earlier observa- 
tion that LHPP expression in L-dKO mice reduced tumorigenicity 
(Fig. 3e-g), suggest that low LHPP expression could correlate with 
reduced patient survival. We referred to The Cancer Genome Atlas 
(TCGA) to determine whether there is a correlation between LHPP 
expression and overall survival or disease-free survival (see Methods). 
LHPP downregulation was associated with faster disease progression 
and worse overall survival in 319 and 370 patients with HCC, respec- 
tively (Fig. 4e, f). Median disease-free survival and overall survival 
of patients with HCC decreased from 25 to 12 months and from 61 
to 38 months, respectively, after the loss of LHPP expression. Thus, 
decreased LHPP expression in human HCC is associated with increased 
histidine phosphorylation and reduced survival. 

We identified 49 LHPP mutations in datasets from TCGA and the 
International Cancer Genome Consortium (ICGC). Twelve (24.5%) of 
the mutations, found in oesophagus (2), head and neck (2), stomach 
(2), bladder (1), breast (1), skin (1), liver (1), lung (1), and pancreas 
(1), are predicted to be inactivating mutations (Extended Data Fig. 6f), 
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that is, nonsense, frameshift or splice site mutations (Supplementary 
Table 5). The >20% prevalence of inactivating mutations suggests that 
LHPP is a tumour suppressor gene’. This suggestion is strengthened 
by two genome-wide association studies (GWAS) indicating that the 
LHPP locus is associated with oral cancer, pharyngeal cancer and 
acute lymphoblastic leukaemia”**. Whole-genome sequencing pre- 
viously identified the LHPP locus as a risk factor for major depressive 
disorder?>”®. These observations suggest that loss of LHPP and thus 
increased histidine phosphorylation is oncogenic and possibly also 
involved in other disorders. Importantly, the results collectively suggest 
that LHPP is a histidine phosphatase and tumour suppressor. 

How might histidine phosphorylation be related to cancer? The 
histidine-phosphorylated proteins we detected preferentially in 
tumours include the crucial DNA replication factors MCM3, MCM6 
and MCM7, the p53 inhibitor AROS, the cytidine deaminase ABEC3 
that converts cytosine to uracil and may induce mutations, the receptor- 
mediated endocytosis and NOTCH signalling protein AAK1, the 
mTORCI effector RSK2?’, and CHTOP. The potential LHPP targets 
we detected that might be important for tumorigenesis include the 
essential fatty acid biosynthetic enzyme ACLY*, the proto-oncoprotein 
and RNA polymerase-associated protein LEO17%, a variant histone 
H2A commonly expressed in HCC?’, and the inflammation-related 
and HCC-promoting HMGB1 protein* (Supplementary Table 4). 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Animals. Liver-specific Tsc1 and Pten double-knockout mice were obtained by 
crossing Tsc1!°*! mice (exons 17 and 18)*! with Pten!*!°* mice (exons 4 and 
5)? to transgenic mice expressing Cre recombinase under the control of the 
hepatocyte-specific albumin promoter (Alb-CreTg/0)*°, to generate liver-specific 
double-knockout (Tsc1!""**Pten'*/"* Alb-CreTg/0, in short L-dKO) mice. Age- 
and sex-matched littermate Tsc1!/"°*Pten!*“"** mice without the cre genes were 
used as controls in all animal experiments. No other randomization protocol or 
blinding was applied beyond this criterion. The mice produced were on mixed 
genetic background (C57BL/6], 129/SvJae, BALB/cJ). Mice were housed under 
temperature and humidity-controlled conditions, in a 12-h light/dark cycle with 
lights switched on between 06:00 to 18:00. All experiments were conducted on male 
mice (age between 6 and 20 weeks). In all experiments, mice were fasted overnight 
before euthanasia by CO inhalation. All relevant ethical regulations were followed 
in this study. The tumour harvest time limit was set at 20 weeks. In addition, the 
clinical conditions of mice were monitored according to the score sheet as stipu- 
lated by the Institutional Animal Care and Use Committee (IACUC), Kantonales 
Veterinaeramt of Kanton Basel-Stadt. None of the guidelines was exceeded in any 
experiment performed. For all animal experiments, respecting the animal ethical 
rules, a minimum number of animals required to get a statistically meaningful 
result were used. All animal experiments were performed in accordance with the 
federal ethical guidelines and were approved by the Kantonales Veterinaeramt of 
Kanton Basel-Stadt. 

Mass spectrometry. Sample preparation for proteome analysis of tumours. Tumours 
from 20-week-old L-dKO mice and whole liver from littermate control mice were 
carefully dissected under magnifying lens and snap-frozen in liquid nitrogen. 
Frozen tissue was pulverized in a metal plate cooled on dry ice and was trans- 
ferred into a cooled microcentrifuge tube. The powdered tissue was resuspended 
in 8 M urea (Applichem, A1086) (containing 50 mM Tris-HCl (pH 8.0), 150 mM 
NaCl, 1mM PMSE, 1x Complete Mini Protease Inhibitors (Roche),1 x PhosSTOP 
(Roche)) and homogenized using a Polytron (PT 10-35 GT) at 500g for 2 min. 
The sample was placed on ice during this process. Lysates were then subjected to 
rotation at 4°C with regular vortexing. Lysates were then centrifuged at 10,000g 
for 10 min at 4°C to remove cell debris. Protein concentration in the supernatant 
was determined using Bradford assay. Equal amounts (5 mg each) of ‘pooled con- 
trol and tumour tissue were reduced with 2.5 mM dithiothreitol (DTT) (Sigma) 
for 45 min at 56°C, cooled for 5-10 min at room temperature and alkylated with 
7.5mM iodoacetamide (IAA) (Sigma) for 45 min at room temperature (reaction 
protected from light). The urea concentration was reduced to 4M by diluting 
with 25 mM Tris-HCl, pH 8.0. Proteins were treated with endoproteinase LysC 
(1:100 (w/w), Wako) at 37°C for 2h. The urea concentration of the LysC digest was 
reduced to 1 M. Later, trypsin (Worthington) digestion (1:100; w/w) was performed 
for 2h at 37°C, followed by a second round of trypsin digestion (1:100; w/w) over- 
night at 37°C. Digestion was stopped by adding 0.4% trifluoroacetic acid (TFA) 
(Applied Biosystems) (v/v) that lowered the pH of the solution to below pH 2.0. 
Later, the digest was centrifuged at 10,000g for 10 min at room temperature and the 
supernatant was desalted on a C18 reverse-phase SepPak-200mg column (Waters). 
First, the cartridge was primed with 2 ml 100% acetonitrile (ACN) (Biosolve), 
followed by 4 ml of 50% acetonitrile containing 0.5% acetic acid (Merck) and 
equilibrated with 4ml 0.1% TFA. Peptides were loaded twice in 0.4% TFA and 
the cartridge was washed with 10 ml of 0.1% TFA. Later, the cartridge was washed 
with 0.5% acetic acid and the peptides were eluted with 4 ml of 80% ACN, 0.5% 
acetic acid. The peptides were dried in a SpeedVac and subjected to strong cation 
exchange chromatography (SCX). SCX fractionation was performed according 
to*4 with a few modifications detailed below. Storage buffer was removed from a 
HiTrap SP cartridge (GE Healthcare, 17115101) and washed twice with 2 ml of SCX 
buffer A (5 mM KH>PO, (pH 2.65), 30% can). Later, the cartridge was equilibrated 
twice with 2 ml of SCX buffer B (5 mM KH»PO, (pH 2.65), 30% ACN containing 
500mM KC), and the cartridge was re-equilibrated with 2 ml of SCX buffer A. The 
peptides were resuspended in 1 ml of SCX buffer A, centrifuged at 10,000g for 2 min 
to remove any unwanted debris. The peptides were then loaded with a syringe 
onto the HiTrap SP cartridge. The flow-through was then collected. The bound 
peptides were then desorbed in a stepwise manner: with 1 ml each of SCX buffer 
A containing 50 mM, 100mM, 150mM, 250 mM and 350 mM KCL. Each fraction 
was collected and the peptide concentration was estimated at 280 nm. The fractions 
were dried in a SpeedVac and peptides were then desalted on Microspin cartridges 
(Vydac). 101g of each fraction was removed for liquid chromatography-tandem 
mass spectrometry (LC-MS/MS proteome analysis). 

LC-MS/MS analysis. Dried peptides were dissolved in 20 1l of 0.1% acetic acid, 
0.005% TFA. The peptides were analysed by capillary LC-MS/MS using a home- 
made separating column (0.075 mm x 18 cm) packed with Reprosil C18 reverse- 
phase material (2.4j1m particle size). The column was connected on line to an 
Orbitrap FT hybrid instrument (Thermo Scientific). Solvent A (0.1% acetic acid 
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in water, 0.005% TFA) and solvent B (0.1% acetic acid, 0.005% TFA and 80% ace- 
tonitrile in water) were used for peptide separation. 211 of peptide digest were 
injected with a Proxeon Easy-LC capillary pump (Thermo Scientific) set to 
0.31 min”. A gradient from 0 to 40% solvent B in solvent A in 190 min was 
delivered with the nano pump at a flow rate of 300 nlmin~!. The percentage of 
solvent B was increased to 75% in 10 min at the end of the gradient. Later, the 
eluting peptides were ionized at 2.5kV and mass spectrometer was operated in 
data-dependent mode. The precursor scan was done in the Orbitrap set to a 60,000 
resolution, while the fragment ions were mass analysed in the LTQ instrument. 
A top twenty method was used for fragmentation. 

Protein identification. The LC-MS/MS data were searched with Proteome 
Discoverer 1.4 (Thermo Scientific) set to Mascot and Sequest HT search engines 
with 10 ppm precursor ion tolerance. The fragment ions were set to 0.6 Da 
tolerance. Medium confidence (5% FDR) was used for peptide search. Swiss-Prot 
KB database set to M. musculus was used for Mascot. For Sequest HT, an inhouse 
databank was constructed by extracting M. musculus entries from UniProtKB. 
Quantification. The raw files from the LC-MS/MS were first passed through 
Progenesis QI software (Nonlinear Dynamics, Waters) and the matched SCX 
fractions between ‘pooled control and tumours were aligned. A minimum of 75% 
alignment was set for analysis. During alignment, the search identifications from 
Proteome Discoverer (version 1.4) were imported into Progenesis for matching 
the aligned features with the identified peptides. 

Statistical analysis. R-based Perseus program, version 1.4.0.2, was used for statis- 
tical data analysis. An ANOVA-based two-sample t-test was performed, adjusting 
SO to 1, the number of randomizations to 250 (default), and the FDR to 5% for the 
proteome**®, *P < 0.05, **P< 0.01, ***P< 0.001, ****P< 0.0001. 

p-His antibody immunoprecipitation of proteins for mass spectrometry. Tissue 
and cell lysis for pHis immunoprecipitation. Tumours from 20-week-old L-dKO 
mice and whole liver from littermate control mice were carefully dissected under 
magnifying lens and snap-frozen in liquid nitrogen. Frozen tissue was pulverized 
ina metal plate cooled on dry ice and was transferred into a cooled microcentrifuge 
tube. The powdered tissue was resuspended in 25 mM Tris-HCl (pH 8.5); 100 mM 
NaCl, pH: 8.5; 1mM PMSF, 1x Complete Mini Protease Inhibitors (Roche), 
1x PhosSTOP (Roche) and homogenized using a Polytron (PT 10-35 GT) at 500g 
for 2 min. Precautions were taken to preserve pHis (pH 8.5 and constant 4°C) and 
reagents (such as detergents) that can interfere with mass spectrometry. The sample 
was placed on ice during this process. Lysates were then incubated at 4°C on an 
end-over-end rotator for 2 h with regular vortexing. Lysates were then centrifuged 
at 10,000g for 10 min at 4°C to remove cell debris. Protein concentration in the 
supernatant was determined using BCA assay. Tissue lysate from four different 
littermate control animals was pooled to obtain a pooled control. For immuno- 
precipitation of pHis proteins from CB1 cells with and without LHPP, 2 mg for 
each condition was used. 

Bead preparation and preclearing of the lysate. One millilitre of protein A Sepharose 
beads (Thermo, 101042) was mixed with 1 ml of lysis buffer (25 mM Tris-HCl 
(pH 8.5); 100mM NaCl, pH: 8.5; 1mM PMSF, 1x Complete Mini Protease 
Inhibitors (Roche), 1x PhosSTOP (Roche)) and washed twice (by spinning at 
2,000g, 4 min, 4°C) and resuspended in 2 ml of lysis buffer. For pre-clearing, the 
protein lysate (10 mg for tissues, 2 mg for cells) was incubated with 200 11 of pre- 
pared beads and incubated at 4°C on an end-over-end rotator for 2h. Later, the 
lysate was spun (2,000g, 4min, 4°C) and the precleared lysate (supernatant) was 
divided into two different tubes (5 mg each for tissues, 1 mg each for cells). 
Antibody-lysate incubation. For tissue lysates, both 1-pHis and 3-pHis (clones 
SC1-1 and SC44-1) antibodies were used. For cell lysates, only 3-pHis antibody 
was used. Lysates were incubated with pHis antibody (30 1g of 3-pHis plus 101g 
of 1-pHis for tissue lysate; 30 1g of 3-pHis for cell lysate) or IgG antibody (40 1g 
for tissue lysate and 30 1g for cell lysate). Lysates were incubated at 4°C on an 
end-over-end rotator for 2h. Later, 200 il of beads in lysis buffer were added and 
incubation continued at 4°C on an end-over-end rotator for 12h. 

Washing the beads and elution of the proteins. The column was washed four times 
with 2 ml of lysis buffer (pre-cooled) at 4°C. After the third wash, the fourth 
wash was performed in a new microcentrifuge tube to avoid any contamination 
from proteins attached to the wall of the microcentrifuge tube. Proteins were 
eluted in two fractions (El and E2) with 2x 600,11 100 mM triethylamine (TEA), 
pH 11. The fractions were dried in a SpeedVac and resuspended in 50 mM 
Tris-HCl (pH 7.5). 

Sample preparation. The eluate in 50 mM Tris-HCl was treated with endoproteinase 
LysC (1:100 (w/w), Wako) at 37°C for 2h. Later, trypsin (Sigma) digestion (1:100; 
w/w) was performed for 2h at 37°C (pH 8.0), followed by a second round of 
trypsin digestion (1:100; w/w) overnight at 37°C. Digestion was stopped by adding 
0.4% TFA (Applied Biosystems) (v/v) that lowered the pH of the solution to below 
pH 2.0. Later, the digest was centrifuged at 10,000g for 10 min at room temperature 
and the supernatant was desalted on Microspin cartridges (Vydac). 
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First, the cartridge was activated with 1001] methanol, followed by 10011 80% 
ACN, 0.5% acetic acid and the column was equilibrated with 2 x 100,11 0.1% TFA, 
1% ACN. Peptides were loaded twice in 0.4% TFA and the cartridge was washed 
five times with 10011 0.1% TFA, 1% ACN. Later, the peptides were eluted with 
2 x 10011 80% ACN, 0.5% acetic acid. The peptides were dried in a SpeedVac 
and subjected to LC-MS/MS proteome analysis. For further details, please see the 
‘LC-MS/MS analysis’ section above. 

Protein identification. The LC-MS/MS data were searched with MaxQuant*’, 
version 1.5.3.8 and searched against the mouse Swiss-Prot database 2017-05-09. 
A mass tolerance of 20 p.p.m. was allowed for the first search, and a mass 
tolerance of 10 p.p.m. was allowed for the main search. Two missed cleavages 
were allowed. Peptide FDR, protein FDR, and site decoy fraction were set to 
2%. For protein quantification, a ratio count of 1 was applied. The fragment 
ions were set to 0.6 Da tolerance. Medium confidence (5% FDR) was used for 
peptide search. 

Quantification. The protein groups datasets were exported into a FileMaker 
Pro-12 databank. 

Identification of proteins with histidine phosphorylation in tumours. Samples 
were run in triplicates. Label-free quantification (LFQ) intensities were obtained 
for each protein from MaxQuant. A mean of LFQ intensities was calculated and 
used for all further calculations. Proteins without any ratio owing to no detection 
in either of the samples were also included in the analysis. Mass spectrometry 
analysis identified 3,265 proteins in the tumour immunoprecipitate from a total of 
4 different L-dKO tumours. A total of 1,508 proteins were at least twofold-enriched 
in the pHis immunoprecipitate compared to the control IgG immunoprecipitate 
in more than 1 tumour. Out of these, 1,173 proteins were detected and enriched 
(minimum twofold) in 2 out of 2 tumours (100%), or 3 out of 3 tumours (100%), 
or 3 out of 4 tumours (75%, 100%) (Supplementary Table 2). 

Identification of proteins with histidine phosphorylation in control tissue. Mass 
spectrometry analysis identified 3,011 proteins in the control tissue immunopre- 
cipitate from a total of 4 different control animals. A total of 1,141 proteins were 
at least twofold-enriched in the pHis immunoprecipitate compared to the control 
IgG immunoprecipitate in more than 1 tumour. Out of these, 823 proteins were 
detected and enriched (minimum twofold) in 2 out of 2 control tissues (100%), 
or 3 out of 3 control tissues (100%), or 3 out of 4 control tissues (75%, 100%) 
(Supplementary Table 1). 

Identification of proteins with increased histidine phosphorylation in 
tumours. Out of the proteins detected in tumour (1,173) and control (823) tis- 
sue, 236 proteins were detected and enriched (minimum twofold) in 2 out of 2 
experiments (100%), or 3 out of 3 experiments (100%), or 3 out of 4 experiments 
(75%, 100%) (Supplementary Table 3). A total of 24 proteins that are exclusively 
detected in tumour tissue and hence had no ratio are indicated in the list shown 
in Supplementary Table 3. 

Identification of proteins with decreased histidine phosphorylation in CB1 cells 
upon LHPP re-expression. Mass spectrometry analysis identified 295 proteins in 
the CB1 immunoprecipitate from a total of 5 different experiments. Nine proteins 
were at least 1.7-fold enriched in the CB1 cells lacking LHPP compared to the 
control IgG immunoprecipitate and LHPP re-expressing cells in minimum 50% 
of the experiments (Supplementary Table 4). 

Antibodies. PTEN (CST, 9188), TSC1 (Bethyl, A300-316A), phospho-S6 (CST, 
4856), total S6 (CST, 2217), pAKT $473 (CST, 9271), total AKT (CST, 9272), 
Calnexin (Enzo, ADI-SPA-860), NME1 (CST, 3345), NME2 (Abcam, ab60602), 
and LHPP (Novus, NBP1-83273) were obtained commercially. Phospho-histidine 
monoclonal antibodies against N1 and N3 positions were as described previously’. 
The monoclonal antibodies used to detect 3-pHis and 1-pHis were SC44-1 and 
SC1-1, respectively. Horseradish peroxidase (HRP)-coupled anti-mouse (115-035- 
774) and anti-rabbit (211-032-171) secondary antibodies were purchased from 
Jackson laboratories. 

Immunoblot analysis. For regular immunoblot analysis, the liver tissue was 
homogenized in T-PER (ThermoFisher scientific, 78510) supplemented with 1mM 
PMSE, 1x Complete Mini Protease Inhibitors (Roche), 1x PhosSTOP (Roche) 
using a Polytron (PT 10-35 GT) at 500g for 2 min. Equal amounts of homogenate 
were SDS-PAGE fractionated and transferred onto a nitrocellulose membrane 
that was incubated, after blocking (5% BSA in TBST), with appropriate antibodies. 
For phosphohistidine analysis, tissues (equal wet weights) and equal number of 
cells were lysed directly in 2x pHis sample buffer (5x pHis sample buffer: 250 mM 
Tris-HCl pH 8.8, 0.02% bromophenol blue, 50% glycerol, 50 mM EDTA, 500 mM 
DTT and 10% SDS), incubated on ice and sonicated (15s x 3 times, Hielscher 
VialTweeter) to disrupt cells and shear DNA. Lysates were clarified by centrifu- 
gation (10,000g for 15 min at 4°C). Fresh lysates were resolved immediately using 
polyacrylamide gels at 4°C with a modified stacking gel, pH 8.8 and 10% resolv- 
ing gel, pH 8.8. All electrophoresis and protein transfer was strictly carried out 
at 4°C. In addition, the running and transfer buffers were precooled before use. 


The nitrocellulose membranes were blocked in casein blocking buffer (0.1% casein, 
10% PBS, pH 8.8). Primary antibodies were resuspended in casein blocking buffer 
with 0.1% Tween-20, incubated overnight at 4°C. The monoclonal antibodies used 
to detect 3-pHis and 1-pHis were SC44-1 and SC1-1, respectively at a dilution of 
0.5j1g ml~!. The membranes were washed three times (15 min per wash at 4°C) 
with TBST (modification, pH 8.8) and incubated with anti-rabbit HRP conjugated 
secondary antibody (1:10,000) at 4°C. Lysates in sample buffer (pH 8.8) heated 
at 95°C for 10 min to dephosphorylate pHis served as a control. The bands in the 
heated lane are either heat-resistant pHis bands or a non-specific signal. For quan- 
tification of the pHis immunoblots, band intensities in each lane are normalized to 
intensity of corresponding calnexin protein levels after subtracting the intensities 
from corresponding heated lane; the y-axis represents band intensities. 
Immunoblot analysis on Lhpp adenovirus-infected cells. A 50-60% conflu- 
ent dish was first infected, 24h after infection, the cells (CB1 or SNU449) were 
trypsinized and 2 x 10° cells were seeded into a 6-well adherent tissue culture dish 
and 24h later lysed in pHis compatible buffer (2 pHis sample buffer, for buffer 
composition, see the ‘Immunoblot analysis’ section above). 

Crystal violet staining. On Lhpp-adenovirus infected cells. Twelve hours after 
infection, CB1 (2 x 10‘ per well) and SNU449 (4 x 10+ per well) cells were 
seeded into a 6-well plate and stained with crystal violet (2% crystal violet in 
20% methanol) on day 7 (n= 4). The plate was left to dry at room temperature. 
Absorbance was measured at 560 nm on a Tecan Infinite M1000 machine. The 
software used to control was Tecan i-control, version 1.11.1.0, according to the 
manufacturer's instructions. Reference (or blank) was set with an empty 6-well 
plate. Multiple reads were read per well (15 x 15 reads) and a border of 3,000 1m 
was left in each well. Number of flashes was set to 25 with 10 ms of settle time. 
On LHPP-siRNA-transfected cells. LHPP siRNA (human LHPP ON-TARGET 
plus set of 4, pooled, catalogue number LQ-018950-02-0005; siRNA 1 
CAACCCAAACUGUGUGGIUA, siRNA 2 CAUGAAGGCGCUUGAGUAU, 
siRNA 3 GCAGCACGCCGACAAGUGA, siRNA 4 CUGAAGCGUUCCCG 
GCUGA) and negative control siRNA (ON-TARGET plus non-targeting pool) 
were purchased from Dharmacon. SNU449 cells were transfected in the presence 
of the transfection reagent jetPRIME (PolyPlus transfection) as per the manufac- 
turer’s instructions. Cells were then re-suspended in complete DMEM plus 10% 
FBS medium. Then 36h after transfection, SNU449 cells were re-transfected with 
LHPP siRNA and were seeded in a 6-well plate (4 x 10 per well). The cells were 
stained with crystal violet (2% crystal violet in 20% methanol) on day 3. 

Cell culture. CB1 cells were established from 20-week-old L-dKO tumours. 
Multiple L-dKO tumours were chopped finely into small pieces and washed in 
warm 20 ml PBS. The processed tumour tissues were seeded in a 6-well tissue 
culture dish in DMEM with 10% FBS. Six different clones, CB1—CB6 were selected 
by limited trypsinization and amplified in DMEM with 10% FBS. Clone CB1 
(transformed in vivo) was used for all future assays. SNU449 cells were obtained 
from ATCC and grown in DMEM with 10% FBS. We did not authenticate the 
cell line in our laboratory. The CB1 cell line was authenticated as hepatocytes 
by probing for the markers TSC1 and PTEN that were knocked out only in 
hepatocytes. CB1 and SNU449 cell lines were tested and found to be free of myco- 
plasma contamination. 

In vitro phosphatase assay. CB1 cells were lysed in 2 sample buffer (pH 8.8) and 
electrophoresed in two sets using a 10% modified SDS gel (see the ‘Immunoblot 
analysis’ section above). The proteins were transferred onto a nitrocellulose 
membrane. After transfer, the membrane was cut into two halves. One half of 
the membrane was treated with 0.5 1g of recombinant LHPP (Abcam, ab116175) 
in 500,11 Tris-MgCl, buffer (20 mM Tris-HCl, pH 8.0, and 5mM MgCly). The 
other half of the membrane was treated with 1 ml Tris-MgCl, buffer. The assay 
was carried out at room temperature. After 4h of treatment with LHPP, the blots 
were washed in casein blocking buffer (pH 8.8) and immunoblotted with 3-pHis 
antibody. The monoclonal antibody used to detect 3-pHis was SC44-1. 

INK128 treatment and AAV administration. For acute INK128 administration, 
20-week-old L-dKO (n =8) and control animals (n = 8) were injected with INK128 
(1mg kg! of body weight) or with the vehicle (intraperitonial injection). INK128 
was dissolved in 5% 1-methyl-2-pyrrolidionone, 15% polyvinylpyrrolidone K30, 
and 80% water. INK128 injection was performed twice (06:00 and 18:00) over 24h. 
For AAV administration, 8-week-old control and L-dKO mice were infected with 
AAV/DJ-ALBp-REP (AAV-control) or AAV/DJ-ALBp-RFP-2A-mLhpp (AAV- 
LHPP) (5 x 101! genome copies per mouse) via tail vein injection. 
Hepatosphere assays. CB1 cells (2 x 10° cells per well) were seeded in 6-well 
ultra-low attachment plates (Corning, 3471, Costar 6 Well Plate with lid) in 
serum-free DMEM/F12 with recombinant murine epidermal growth factor 
(20ng ml~!; R&D Systems), murine hepatocyte growth factor (Sigma-Aldrich), 
and 1x B27 serum-free supplement (Invitrogen). Cells were cultured at 37°C, 
in 5% CO. On the fifth day, hepatospheres measuring greater than 100}1m were 
counted for the assay. 
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Histopathology and immunohistochemistry. After euthanization of mice, 
the livers were fixed in 4% paraformaldehyde, dehydrated and embedded into 
paraffin wax blocks. Embedded-tissues were cut into 4-\1m-thick sections 
placed on SuperFrost slides (Thermo Scientific) and stained with H&E (Merck). 
Imunohistochemistry was performed using the antibodies NME1 (CST, 3345, dilu- 
tion 1:100), NME2 (Abcam, ab60602, 1:100), and LHPP (Novus, NBP1-83273, 
1:200). 

HCC tissue specimens and tissue microarrays. Formalin-fixed and paraffin- 
embedded tumour specimens, as well as fresh frozen tissue were obtained from 
the Institute of Pathology, University of Basel, Switzerland. For tissue microarray 
construction, a representative tumour area was selected on an H&E-stained slide 
of the donor block. Afterwards, a core punch with a diameter of 0.6 mm was taken 
from the tumour, and in selected cases from the non-tumoral liver tissue. Core 
punches were transferred to a new paraffin recipient block by using a programmed 
tissue arrayer (Beecher Instruments). The tissue microarray contained 30 HCC 
samples and 30 non-tumour liver control samples. Microarrays were cut in sections 
of 441m thickness and stained with a polyclonal LHPP antibody diluted at 1:200 
using an automatic Benchmark XT staining machine (Ventana Medical Systems 
Inc.). LHPP staining intensity was evaluated by a clinical pathologist (M.S.M,) and 
graded semiquantitatively into: 0 for negative staining, 1 for weak positive staining, 
2 for moderate positive staining and 3 for strong positive staining (Extended Data 
Fig. 6b). Ten pairs of samples in which one of the tissue sections was washed away 
during staining were ignored during evaluation. While scoring for staining inten- 
sity, the pathologist was double blinded to the antigen that was probed and the 
sample identity. 

Re-analysis of transcriptomic profiling data. LHPP expression was evaluated 
in 59 HCC needle biopsies, 37 of which were matched with their corresponding 
non-neoplastic liver parenchyma (cirrhotic tissues), using published transcrip- 
tomic data (GSE64041)*!. CEL files were normalized using the Qlucore Omics 
Explorer software (Qlucore AB). LHPP expression was extracted for each sample. 
RNA sequencing gene expression data including outcomes from 370 hepatocel- 
lular carcinomas were retrieved from The Cancer Genome Atlas dataset (TCGA, 
Provisional) using the cbioportal (http://www.cbioportal.org, accessed 4 November 
2016) website. Downregulation of LHPP was defined as z-score < —1. 

LHPP mutation analysis. We interrogated the cBioPortal**° and ICGC data 
portal (https://dcc.icgc.org) for deletions at the LHPP locus in samples across 
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33 cancer types. We identified 32 cases harbouring deletion at the LHPP locus. 
Furthermore, we identified 49 mutations across the datasets. We removed duplicate 
TCGA samples that appear in both data portals and variants with >1% population 
frequencies in the Exome Aggregation Consortium data set, ExAC”. 12 out of 
49 (24.5%) of the mutations are inactivating mutations (Supplementary Table 5). 

Serum analysis. Liver damage enzymes ALT, AST and LDH were measured using 
a biochemical analyser (Cobas c 111 analyser, Roche). 

Patient material and ethics. All relevant ethical regulations were followed in this 
study. All the experiments with human tissue samples reported in this study were 
approved by the ethics commission of Northwestern Switzerland (EKNZ, approval 
No. 361/12). An informed consent was obtained from the human subjects. 

Data availability. All data generated during this study are included in this paper 
and its figures, Extended Data and Supplementary Information. For gel source data, 
see Supplementary Figs 1-13. In addition, publicly available data were analysed in 
this study and are cited wherever relevant. 
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Extended Data Figure 1 | Proteomics on L-dKO tumours. a, Liver weight 
(LW) to body weight (BW) ratio (percentage) of 20-week-old L-dKO 

mice (n= 9) and age-matched control mice (n= 9). ****P < 0.0001, 
two-sided unpaired t-test. b, mRNA expression analysis in 20-week-old 
L-dKO tumours (1 = 3) compared to livers from age-matched control mice 
(n=3). CD90 (also known as Thy1; *P = 0.0104), CD133 (also known 

as Prom1; *P = 0.0227), Cd44 (*P = 0.0473) and Aldh1 (also known as 
Aldh1a1; *P = 0.0107), two-sided unpaired t-test. Data are mean +s.d. 

c, Diagram showing the scheme used for mass spectrometry-based 
proteomic analysis. Hepatic proteomes from control mice (n = 6) were 


combined and used as a pooled control against tumours (n= 12) from four 
different L-dKO mice. d, Graph showing number of proteins upregulated 
(red), downregulated (blue) and not regulated (grey) in 12 tumours from c. 
ANOVA-based two-sample t-test with a FDR of 2%, SO = 1 was used 

to determine regulation (n = 12). e, List of kinases and phosphatases 
(including putative phosphatases) regulated in a minimum of 10 out of 

12 tumours. f, Lapp mRNA expression analysis in 20-week-old L-dKO 
tumours compared to livers from age-matched control mice (n = 4). 
P=0.0022, two-sided unpaired t-test. Data are mean + s.d. 
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Extended Data Figure 2 | LHPP protein is conserved in eukaryotes. a, LHPP protein is highly conserved in eukaryotes, from worm to human. 
b, Human LHPP protein shows weak homology to known human histidine phosphatases PHPT1 and PGAMS. 
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Extended Data Figure 3 | See next page for caption. 
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Extended Data Figure 3 | Loss of LHPP expression is confined to 
L-dKO tumours. a, Quantification of immunoblot (from Fig. 2a) indicates 
reduced LHPP (***P= 0.0004) and increased NME1 (*P=0.0254) and 
NME2 (*P=0.0105) expression in tumours compared to age-matched 
control littermates (band intensities in each lane are normalized to 
intensity of corresponding calnexin protein levels), control mice (n= 4), 
L-dKO tumours (n = 8). P values are from a two-sided unpaired t-test. 
Data are mean + s.d. b, Immunoblot analysis of liver tumour (n = 6) 

and non-tumour (n = 3) tissue from three independent 20-week-old 
L-dKO mice shows tumour-specific loss of LHPP expression. Ponceau S 
image of the whole blot is also included. For full scan, see Supplementary 
Fig. 3. c, Immunohistochemistry analyses showing increased NME1 
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expression in L-dKO liver compared to liver from control mice. Although 
L-dKO tumours displayed reduced expression of LHPP, hepatic LHPP 
expression in the non-tumour region was comparable to control mice. 
Parallel tissue sections that were processed similarly (without primary 
antibody) served as antibody control. Similar results were obtained in 
livers from 4 independent control mice and L-dKO mice. d, Immunoblot 
analysis of liver from control mice and L-dKO liver tumours treated with 
INK128 (1 mg kg“! body weight; two intraperitonial injections in 24h; 
n=4 mice per condition, lysates from two mice are pooled into one lane). 
LHPP expression remains unchanged, but NME1 and NME2 expression 
is reduced considerably in L-dKO mice after INK128 treatment. For full 
scans, see Supplementary Fig. 4. 
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Extended Data Figure 4 | See next page for caption. 
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Extended Data Figure 4 | LHPP is a protein histidine phosphatase. 

a, Diagram of NME1 and NME2 autophosphorylation on His118. The 
high-energy phosphate is later transferred from the histidine kinase 
dimer pair to the histidine on the substrate. b, CB1 cell lysate was 
electrophoresed and transferred onto a nitrocellulose membrane. Two sets 
of the membrane were treated with recombinant full-length LHPP (for 
4h) in TMD buffer (see Methods) and the other set of the membrane was 
treated with plain TMD buffer (pH 8.5). Immunoblotting analysis of both 
the membranes with 3-pHis antibody (clone SC44-1) indicates that LHPP 
is a protein pHis phosphatase. For full scans, see Supplementary Fig. 6. 

c, Immunoblotting shows no apparent change pSer/Thr or 

pTyr levels in CB1 cells infected with adenovirus containing LHPP 
compared to cells infected with RFP-only control. Similar result 

was observed in two independent experiments. For full scans, see 
Supplementary Fig. 9. d, e, Immunoblotting (d) and quantification (e) 
show a reduction in 3-pHis levels in SNU449 cells infected with 
adenovirus containing LHPP compared to control-infected cells. Similar 
results were observed in four independent biological experiments. 
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For full scans, see Supplementary Fig. 8. A 50-60% confluent dish was first 
infected, 24h after infection, the cells were trypsinized and 2 x 10° cells 
were seeded into a 6-well adherent tissue culture dish and 24h later lysed 
in pHis compatible buffer (see Methods). The monoclonal antibodies used 
to detect 3-pHis and 1-pHis were SC44-1 and SC1-1, respectively (n= 4). 
** P= 0.0019, two-sided ratio paired f-test. Data are mean + s.d. f, Colony- 
forming assay in CB1 cells (n =3 biologically independent experiments) 
and SNU449 (n=5 biologically independent experiments) cells shows 
reduced proliferation upon LHPP over expression. Twelve hours after 
infection, CB1 (2 x 10* per well) and SNU449 (4 x 104 per well) cells 
were seeded into a 6-well plate tissue culture plate and stained with 

crystal violet (2% crystal violet in 20% methanol) on day 7. **P= 0.0026, 
**** P< (),0001, two-sided unpaired t-test. Data are mean + s.d. See 
Methods for further details. g, Photomicrographs of LHPP-overexpressing 
CB1 cells (left) displayed reduced ability to form hepatospheres in ultralow 
attachment plates; n = 3 biologically independent experiments. 

** P — 0.0024, two-sided unpaired t-test. Data are mean +s.d. 

(see Methods). 
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Extended Data Figure 5 | See next page for caption. 
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Extended Data Figure 5 | LHPP is a tumour suppressor. a, Left 

and middle, colony-forming assay in SNU449 cells shows increased 
proliferation upon LHPP knockdown using pooled short interfering RNA 
(siRNA). Then 36h after infection, SNU449 cells were re-transfected with 
LHPP siRNA and 4 x 10* cells per well were seeded into a 6-well tissue 
culture plate and allowed to proliferate for another 3 days. At the end of 
the third day, cells were stained with crystal violet (2% crystal violet in 
20% methanol), and similar results were obtained from five biologically 
independent experiments. **P = 0.0075, two-sided unpaired t-test. Data 
are mean + s.d. Right, immunoblot analysis (m= 1) revealed a reduction 
in LHPP protein levels after siRNA transfection. Longer exposure of 

the LHPP blot relative to the blot in Extended Data Fig. 4d. In addition 

to immunobloting, a reduction in LHPP transcript levels after siRNA 
transfection was also confirmed in two independent experiments (data 
not shown). For full scans, see Supplementary Fig. 10. b, Immunoblot 
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analysis (bottom) and quantification (top) of non-tumour and tumour 
liver lysates confirmed LHPP overexpression in 20-week-old AAV-LHPP- 
infected mice (as in Fig. 3e). For full scans, see Supplementary Fig. 11. 

c, Serum extracted from mice in (Fig. 3e) and analysed for ALT, AST and 
LDH levels shows a reduction in liver damage upon AAV-LHPP infection, 
unit per litre (U1~') (control mice infected with AAV-control and AAV- 
LHPP virus (n= 4), L-dKO mice infected with AAV-control (n = 5) and 
AAV-LHPP virus (n=8)). Data are mean + s.d. P values are by one-way 
ANOVA with Tukey’s multiple-comparison test. ALT (adjusted 

** P— 0.004, ****P < 0.0001), AST (adjusted **P = 0.013 for control 
mice versus L-dKO mice transfected with AAV-control; adjusted 

** P — 0.0075 for L-dKO mice transfected with AAV-control versus 
L-dKO mice transfected with AAV-LHPP), LDH (adjusted *P = 0.0196, 
**P—0,0037). 
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Extended Data Figure 6 | See next page for caption. 
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Extended Data Figure 6 | LHPP is a tumour suppressor in human HCC. 
a, Representative immunohistochemistry images from the LHPP-stained 
tissue microarray. Scale bar, 100m. b, Representative images and staining 
intensities of LHPP staining in the tissue microarray. c, Tissue microarray 
(TMA) indicates that LHPP is significantly downregulated in human 
HCC compared to adjacent non-tumour tissue (n = 20 HCC patients, 
** P < (0001, two-sided paired t-test). d, Box plots showing a 
significant reduction in LHPP expression in HCC tissue compared to 
adjacent non-tumour tissue (n= 59). ****P < 0.0001, two-sided Mann- 
Whitney U test. Non-tumour (minimum, — 1.422, median, 0.03329, 
maximum, 0.6179, lower 95% confidence interval (CI) of mean —0.0931, 
upper 95% CI of mean 0.0931) and HCC (minimum, —1.797, median, 
—0.3729, maximum, 0.7683, lower 95% CI of mean —0.5328, upper 95% 
CI of mean —0.2506). Expression of PHPT1 and PGAMS was not reduced 
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in HCC. e, Box plots showing a strong reduction in LHPP expression in 
Edmondson grade III/IV tumours (aggressive tumours, n = 17) compared 
to Edmondson grade I/II tumours (less aggressive tumours, n = 42). 

** P — 0.0036, two-sided Mann-Whitney U test. Edmondson grade I/II 
(minimum, —1.592, median, —0.2594, maximum 0.7683, lower 95% CI of 
mean — 0.3987, upper 95% CI of mean —0.0986) and III/IV (minimum, 
—1.797, median, —0.5281, maximum, —0.09671, lower 95% CI of mean 
—1.018, upper 95% CI of mean —0.4717). f, Lollipop plot showing the 
distribution of mutations in LHPP across pan-cancer datasets in the 
TCGA and ICGC portal. Solid white circles indicate missense mutations, 
red circles indicate inactivating mutations (including nonsense mutations, 
splice site mutations and frameshift deletions; detailed list of mutations is 
provided in Supplementary Table 5). 
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Optogenetic regulation of engineered cellular 
metabolism for microbial chemical production 


Evan M. Zhao!, Yanfei Zhang', Justin Mehl', Helen Park', Makoto A. Lalwani, Jared E. Toettcher? & José L. Avalos!? 


The optimization of engineered metabolic pathways requires 
careful control over the levels and timing of metabolic enzyme 
expression’ *. Optogenetic tools are ideal for achieving such precise 
control, as light can be applied and removed instantly without 
complex media changes. Here we show that light-controlled 
transcription can be used to enhance the biosynthesis of valuable 
products in engineered Saccharomyces cerevisiae. We introduce 
new optogenetic circuits to shift cells from a light-induced growth 
phase to a darkness-induced production phase, which allows us to 
control fermentation with only light. Furthermore, optogenetic 
control of engineered pathways enables a new mode of bioreactor 
operation using periodic light pulses to tune enzyme expression 
during the production phase of fermentation to increase yields. 
Using these advances, we control the mitochondrial isobutanol 
pathway to produce up to 8.49 + 0.31 g17' of isobutanol and 
2.38 + 0.06 g1—! of 2-methyl-1-butanol micro-aerobically from 
glucose. These results make a compelling case for the application 
of optogenetics to metabolic engineering for the production of 
valuable products. 

Metabolic engineering aims to rewire the metabolism of organisms 
for efficient conversion of inexpensive substrates into valuable products 
such as chemicals, fuels, or drugs'”. Fine-tuning the timing and 
levels of expression of enzymes involved in both natural and engineered 
pathways can relieve bottlenecks and minimize the metabolic burden 
of chemical production**. This is especially critical when the product 
of interest or its precursors are toxic, or when the biosynthetic pathway 
of interest competes with endogenous pathways that are essential for 
cell growth. 

To address these challenges, metabolic engineers frequently use 
inducible systems to control metabolic enzyme expression* (see 
Supplementary Discussion). This approach makes it possible to 
separate bioreactor operation into two phases: a growth phase, during 
which product biosynthesis is repressed, and a production phase, when 
flux through the engineered pathway is maximized. Essential pathways 
that compete with product formation can be controlled with ‘metabolic 
valves’ —genetic programs that express essential enzymes during the 
growth phase to build biomass, and repress them during the production 
phase to redirect metabolism towards desired products>”. 

Light is an attractive strategy to control gene expression in yeast for 
metabolic engineering applications. It is inexpensive and compatible 
with any carbon source or nutrient composition. Furthermore, light 
can be applied or removed instantaneously; this precise control over 
the level or duration of enzyme expression could simplify the screening 
of optimal proportions of metabolic pathway enzymes and enable 
new time-varying modes of control®'°. Light-switchable transcription 
modules have been shown to enable non-toxic, tunable gene expression 
in a variety of organisms’”’, including yeast'!!?"!°. We thus sought 
to test whether optogenetics could be used to control rewired cellular 
metabolisms to overproduce valuable chemicals. 


Here, we describe two powerful optogenetic gene expression 
systems for yeast, OptoEXP and OptoINVRT, based on the blue 
light-activated EL222 gene expression system’. Using these systems, we 
show that it is possible to activate and to repress distinct sets of genes ina 
light-dependent manner. We apply our approach to control endogenous 
and engineered metabolic pathways to define the growth and production 
phases of fermentation, enabling production of three valuable 
chemicals (lactate, isobutanol and 2-methyl-1-butanol (2-MBOH)), 
the biosyntheses of which directly compete with essential ethanol 
production. Using a time-varying illumination schedule, we achieve 
titres of 8.49 + 0.31 gl’ (mean +s.d.) of isobutanol and 2.38 + 0.06 g1"* 
of 2-MBOH. Our results thus reveal that a simple technology— 
bidirectional light-controlled enzyme expression—offers a rich set of 
tools for metabolic engineering. 

Our first goal was to construct bidirectional gene circuits in yeast to 
either induce or repress genes of interest with light. We used the EL222 
optogenetic transcription system, which consists of a light-sensitive 
transcription factor from Erythrobacter litoralis (EL222) and its corre- 
sponding C120 promoter (P12), to drive the expression of a gene of 
interest'®-'8 (Fig. 1a). The EL222 system is a robust and versatile gene 
expression platform, previously applied in Escherichia coli, mammalian 
cells and zebrafish!*!°°, We first constructed a yeast strain, YEZ139, 
in which expression of VP16-EL222 (a fusion of EL222 with the tran- 
scriptional activation domain of VP16 and a nuclear localization signal) 
was driven by the strong constitutive promoter of PGK1 (Pex) (Ppcxi- 
VP16-EL222) and expression of green fluorescent protein (GFP) was 
driven by the Pci29 promoter (Pci29-GFP; see Methods, Supplementary 
Tables 1, 2, Extended Data Fig. 1). We called this system (and variants 
with different promoters driving EL222) OptoEXP. 

OptoEXP enables strong and titratable light-inducible gene 
expression. In both glucose and glycerol media, cells with OptoEXP 
controlling GFP expression show a 43-fold increase in GFP expression 
when exposed to constant light compared to cells incubated in the dark, 
whereas intermittent light pulses produce intermediate expression 
levels (Fig. 1b, Extended Data Fig. 2). We found that all light sources 
used were sufficiently bright to activate the EL222 system maximally; 
thus, varying duty cycle was a reliable and reproducible method for 
achieving intermediate gene expression output (Extended Data Fig. 3, 
Methods). The maximum activation levels of OptoEXP are comparable 
to those reached by the ADH1 promoter (P4py7), a constitutive 
promoter commonly used in metabolic engineering (Fig. 1c). 

To construct a light-repressible gene circuit, we inverted the response 
of the OptoEXP system in a manner akin to the NOT logic gate used in 
digital processes. We harnessed the yeast galactose (GAL) regulon”’, 
in which Gal80 binds to and inhibits the Gal4 transcription factor, 
blocking its ability to induce expression from the GALI 
promoter (Pg,z7). We reasoned that engineering yeast cells with 
constitutive expression of GAL4 and GAL80 expression under the 
control of VP16-EL222 would lead to constitutive expression from 
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Figure 1 | Characterization of optogenetic circuits. a, Reversible 
OptoEXP system based on VP16-EL222 that is sensitive to 450 nm light. 
hv indicates the energy of photons. HTH, helix-turn-helix DNA-binding 
domain. b, GFP expression with OptoEXP (YEZ139) tuned with different 
duty cycles, for example 5-s/80-s pulse means that the light was on for 5s 
and off for 75s. AU, arbitrary units of fluorescence and optical density. 

c, Maximum GFP expression from OptoEXP compared to common 
constitutive promoters using strains YEZ27 (CYC1 promoter, Pcyc:); 
YEZ28 (P4pu1), YEZ29 (Ppgx1), YEZ30 (Prpu3), YEZ31 (Prep) and 
YEZ32 (Pci29). d, OptoINVRT circuit diagram (details in Extended Data 
Fig. 4a, b). e, OptoINVRT circuits in CENPK.2-1C-derived (GAL80A, 
GAL4A) strains YEZ100 (OptoINVRT1), YEZ101 (OptoINVRT2) and 
YEZ102 (OptoINVRT3). f, OptoINVRT circuits in BY4741-derived 
(GAL80A, PDC1A, PDCS5A, PDC6A) strains YEZ115 (OptoINVRT1), 
YEZ116 (OptoINVRT2) and YEZ117 (OptoINVRT3). All data are shown 
as mean values; dots represent individual data points; error bars represent 
the s.d. of four biologically independent 1-ml sample replicates exposed to 
the same conditions. All experiments were repeated at least three times. 


the Pgar promoter in the dark and repression of Pgaz, in the light 
(Fig. 1d, Extended Data Fig. 4a, b). 

Starting from a strain in which both GAL80 and GAL4 are deleted, 
YEZ44, we constructed three variants of this core inverter topology, 
which we termed OptoINVRT1, OptoINVRT2 and OptoINVRT3 
(Supplementary Tables 1, 2). These variants differed in the strength 
of the promoter driving GAL4 and the fusion of a photosensitive 
degron (PSD) domain”? to the C terminus of Gal4 to induce faster 
and more complete light-dependent repression (Extended Data 
Fig. 4a, b and Supplementary Table 3). Using Pg4y;-GFP as a reporter, 
we found that all three OptoINVRT circuits exhibit robust light- 
induced gene repression (Fig. le, Supplementary Table 4). 
OptoINVRT2 has the highest maximum expression in the dark 
(in YEZ101, almost 85% of TEF1 promoter (Prgf;) levels), while 
OptoINVRT3 has the highest levels of repression (in YEZ102, more 
than 70-fold) and lowest expression in full light. All three OptoINVRT 
circuits show similar responses in a second yeast strain, Y202, which 
is relevant for metabolic engineering owing to the deletion of all 
three of its pyruvate decarboxylase genes (strain S288C with the 


684 | NATURE | VOL 555 | 29 MARCH 2018 


Strain: OptoEXP-PDC7 + 
PDC1A, PDC5A, PDC6A, 


rs 
@2 


1 


No ethanol production Ethanol production 
No growth on glucose Growth on glucose 


de 
gs 0.4 
Es} 
2S 
£5 
oO 
2202 
‘ 5 8 
0 5 10 15 20 25 38 
Time after inoculation (h) an 
G 0 
* Wild-type + -20s/80s 5 0 05 10 
constant light .10s/80s Duty cycle per 80-s interval 
«Constant light =» Dark 


-40s/80s 
Figure 2 | A light-dependent metabolic valve for ethanol production. 
a, Using OptoEXP to build light-inducible metabolic valve for PDC1. 
b, Light-dependent growth on glucose of a triple PDC mutant strain 
containing the OptoEXP-PDC1 metabolic valve. Top left, wild-type 
BY4741; top right, YEZ61-23 (PDC1A, PDC5A, PDC6A, Prerj-VP16- 
EL222, Pcj29-PDC1); bottom left, YEZ50C (PDC1A, PDC5A, PDC6A, 
Preri-VP16-EL222); bottom right, YEZ61C (PDC1A, PDC5A, PDC6A, 
Pci20-PDC1). ¢, YEZ61-23 and BY4741 growth in liquid synthetic 
complete medium supplemented with 2% glucose at different light doses, 
with best fits to exponential growth curves. d, Maximum growth rate of 
YEZ61-23 as a function of light. Solid line represents the mean wild-type 
(BY4741) growth rate (dotted lines indicate the s.d.). All data are shown as 
mean values (values for individual points are available in the Source Data 
for this figure); error bars represent the s.d. of three biological replicates 
exposed to the same conditions. All experiments were repeated at least 
three times. 


following mutations: GAL80A, PDC1A, PDC5A and PDC6A) 
(Fig. 1f, Supplementary Table 4). Our results thus demonstrate that the 
OptoINVRT platform can achieve a wide range of expression levels, 
light sensitivities and fold-change responses in different strain back- 
grounds for flexible incorporation in diverse metabolic engineering 
applications. 

To reduce the ethanol by-product in pyruvate-derived chemical 
biosyntheses, while still allowing cell growth on glucose, we used 
OptoEXP to control pyruvate decarboxylation (PDC), an essential 
step in ethanol biosynthesis. Completely removing PDC activity 
in S. cerevisiae (by deleting PDC1, PDC5 and PDC6) does not 
suffice, because the triple deletion renders yeast unable to grow 
on glucose owing to the essential role of these genes in NAD* 
recycling for glycolysis’ and to the lack of effective alternatives for 
ATP generation owing to glucose-mediated repression of respiration”. 
Thus, we used our OptoEXP circuit to build a light-dependent 
metabolic valve for PDC] expression (Fig. 2a). Light stimulation ‘opens’ 
the valve, enabling robust cell growth and ethanol production; dark 
incubation ‘closes’ the valve to limit the metabolism of pyruvate into 
ethanol, thereby inhibiting cell growth on glucose, and enhancing the 
biosynthesis of alternative pyruvate-derived products. 

A strain with all PDC genes deleted and with OptoEXP driving 
PDC1 expression, YEZ61-23 (Supplementary Table 2), exhibits light- 
dependent growth on glucose. YEZ61-23 grows on glycerol with 
ethanol (yeast peptone with 3% glycerol and 2% ethanol (YPGE)) 
independently of light because PDC is dispensable for respiration. 
However, it is only able to grow on glucose plates (yeast peptone 
with 2% glucose (YPD)) in the presence of blue light (Fig. 2b), with 
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Figure 3 | Light-controlled isobutanol production. a, Ethanol and 
isobutanol biosynthesis controlled by OptoINVRT-ILV2 and OptoEXP- 
PDC1, respectively. b, Blue light allows growth of YEZ167-4 in glucose, as 
well as ethanol formation, whereas dark conditions initiate the production 
phase by stopping PDC1 induction and de-repressing ILV2. c, Isobutanol 
and 2-MBOH production of YEZ167-4 in fermentations of 80h in 150g]! 
glucose and blue light pulses of 15s on and 65s off for 30 min every 5, 

10, or 20h. YZy335 is a light-insensitive isobutanol-producing control 
strain. d, Growth of YEZ167-4 in a 2-] bioreactor and under constant blue 
light, compared to the YZy335 light-insensitive control. e, Isobutanol and 


its growth rate in liquid medium reaching 91.9 + 4.9% of the wild 
type (BY4741) at full light (Fig. 2c, d). Substantial growth is achieved 
even at low light doses (43.7 + 2.7% of wild-type growth rate with 
only 12.5% light duty cycle), suggesting that light-dependent growth 
may be achieved at cell densities relevant for metabolic engineering 
applications. 

We then tested whether the bidirectional control afforded by com- 
bining our OptoEXP and OptoINVRT circuits could be used to drive 
two phases of cellular metabolism: a growth phase with mainly ethanol 
fermentation and a production phase in which carbon accumulates 
as a desired valuable product instead (either lactate or isobutanol). 
Production of lactate or isobutanol competes directly with ethanol 
production as distinct enzymes modify pyruvate to produce each 
product (pyruvate decarboxylase 1 (Pdc1) in the case of ethanol, 
lactate dehydrogenase (Ldh) in the case of lactate, and acetolactate 
synthase (Ilv2) and subsequent enzymes in the case of isobutanol) 
(Fig. 3a, Extended Data Fig. 5a). We reasoned that by controlling 
the expression of PDC1 with OptoEXP and the expression of Ldh or 
ILV2 with OptoINVRT, moving cells from light to dark would shift 


2-MBOH production of YEZ167-4 in a 0.5-1 fed-batch pH-controlled 
fermenter using periodic light stimulation during the production phase. 
Total fusel alcohols represent the sum of isobutanol and 2-MBOH. Blue 
bar indicates the time of blue light exposure. Dotted lines indicate the 
start and stop of the glucose feed. All data are shown as mean values; dots 
in c represent individual data points; individual points for d and e are 
available in the Source Data for this figure and Supplementary Table 5, 
respectively; error bars represent the s.d. of three biological replicates (c) 
or three separate bioreactor runs (d, e). All experiments were repeated at 
least twice. 


the metabolism from the production of ethanol to the production of 
lactate or isobutanol (Fig. 3b, Extended Data Fig. 5b). 

We first tested our strategy in the simpler case of lactate biosynthesis, 
which requires overexpression of only one enzyme (Ldh). The 
presence of lactate is also is well tolerated by yeast. We integrated 
multiple copies of Pgar;-Ldh and P¢j29-PDC1 into our Y202- 
derived OptoINVRT strains (Extended Data Fig. 6a) and observed a 
pronounced increase in lactate biosynthesis upon a shift from dark 
to light, consistent with light-dependent changes in PDC1 and Ldh 
expression levels (Methods, Extended Data Figs 4c, 5). On the basis of 
these results, we next tested our optogenetic circuits on the more com- 
plex biosynthesis of isobutanol (Fig. 3a, b), an advanced biofuel with 
much higher toxicity than lactate. Our approach was to optogenetically 
control only the first enzyme in the isobutanol pathway (acetolactate 
synthase), reasoning that if subsequent enzymes were constitutively 
expressed, the overall pathway flux would still be light-controlled. 

We transformed Y202 with OptoEXP, each one of the OptoINVRT 
circuits, multiple copies of Pg4r;-ILV2 and Pcj29-PDC1 and the rest 
of the mitochondrial isobutanol biosynthetic pathway under strong 
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constitutive promoters” , to produce strains YEZ159, YEZ156 and HPY6 
(Supplementary Tables 1, 2, Methods). We found that colonies from 
YEZ159, containing OptoINVRT1, produced the highest isobutanol 
titres from 4% glucose (Extended Data Fig. 4d, e). To enhance isobu- 
tanol production further, we deleted the mitochondrial branched chain 
amino acid aminotransferase (encoded by BAT1, also known as YBT1), 
which competes for the «-ketoisovalerate precursor””’, resulting 
in strain YEZ167-4 (Supplementary Table 2). YEZ167-4 contains six 
copies of Pcy29-PDC1 (Extended Data Fig. 6a), leading to fast cell 
growth under full light. 

By varying the cell density at which cultures are switched from 
dark to light and the incubation time in the dark before starting the 
fermentation (Extended Data Fig. 5c), we found that YEZ167-4 can 
produce as much as 735 + 15mg] of isobutanol from 2% glucose over 
48h (a yield of 34.2 + 0.7 mg isobutanol g! glucose, Extended Data 
Fig. 7). Switching to 15% glucose and fermentation periods of 80h 
only increases production to 1.22 +0.11 gl7’ of isobutanol (Fig. 3c). 
However, under these conditions YEZ167-4 is unable to consume all 
the glucose in the medium (Extended Data Fig. 8a), indicating a stalled 
fermentation in which cells have undergone premature metabolic arrest. 

We hypothesized that during prolonged dark incubation, Pdcl 
becomes limiting to a point at which cellular metabolism arrests 
owing to NAD* depletion. We reasoned that periodic pulses of light 
during the production phase of fermentation could transiently induce 
PDC1 expression, thus increasing NAD* pools and restoring cellular 
metabolism, glucose consumption and isobutanol production. We 
tested different light schedules, applying periodic illumination bouts 
(30 min of blue light at a duty cycle of 15s on and 65s off once every 
5, 10 or 20h throughout the production phase of the fermentation, 
which lasted 80h). When cells are exposed to one light bout every 10h, 
isobutanol production nearly triples to 3.37 £0.17 g1~! (Fig. 3c), and 
2-MBOH, another desirable advanced biofuel co-produced with the 
mitochondrial isobutanol pathway’’, is produced at 433 +69mgl~’. 
These isobutanol yields are more than 2.5 times higher than if cells 
are kept in the dark throughout the production phase of the fermen- 
tation, and four times higher than YZy335, a strain that contains the 
entire isobutanol pathway and BAT! deletion but lacks light-inducible 
metabolic control. We directly confirmed that our light pulses partially 
replenish the NADt metabolite pool under these fermentation condi- 
tions (Methods, Extended Data Fig. 8b). 

To determine the applicability of optogenetics to metabolic engi- 
neering in laboratory-scale bioreactors, we measured the growth of 
the OptoEXP-driven PDC] strain, YEZ167-4, in a 2-l fermenter with 
15% glucose and constant blue light (Extended Data Fig. 9a, b). Under 
these conditions, YEZ167-4 is able to reach the same optical density at 
600 nm (OD¢00 = 17.1) as a light-independent, PDC* strain, YZy335 
(Supplementary Table 2), indicating that light penetration into the 
bioreactor is sufficient to drive robust PDC1 expression and growth 
(Fig. 3d). We then measured the production of isobutanol and 
2-MBOH in fed-batch 0.5-1 bioreactors under microaerobic conditions 
and pH control (Extended Data Fig. 9c, d). We used constant illumi- 
nation to build YEZ167-4 biomass during a batch growth phase, and 
then switched to a fed-batch production phase with periodic bouts of 
light to maintain sufficient NAD* levels to preserve metabolic activity. 
This fermentation strategy yielded 8.49 + 0.31 gl! of isobutanol and 
2.38 + 0.06 g1~! of 2-MBOH, with post-induction average yields of 
53.5+8.4mg¢ glucose for isobutanol (13% of theoretical maximum) 
and 14.17 +2.57 mgg~' glucose of 2-MBOH, with measured instanta- 
neous isobutanol yields of up to 270.6mgg! glucose (66% of theoret- 
ical yield) (Fig. 3e, Supplementary Table 5, Methods, Supplementary 
Results). Our mechanistic predictions of a darkness-induced metabolic 
shift to isobutanol biosynthesis are supported by quantitative PCR 
(qPCR) measurements of PDC1 and ILV2 mRNA levels (Extended 
Data Fig. 6b). 

Optogenetics is a powerful way to control cellular physiology. 
Photosensitive proteins have had an enormous impact in neuroscience, 
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cell biology and developmental biology by controlling ion channels, 
enzyme activity and gene expression®”* °°. Our work demonstrates that 
optogenetics also holds promise in metabolic engineering, enabling the 
reversible control and fine-tuning of engineered metabolic pathways. 

Nevertheless, combining optogenetics and metabolic engineering has 
its challenges. The high cell densities usually associated with microbial 
fermentations might be predicted to severely limit light penetration, 
which is one reason we initially adopted our design of a darkness- 
induced production phase. However, additional experiments showed 
robust light-stimulated gene expression could be achieved even at 
cell densities as high as OD¢09 = 50 in 5-1 bioreactors (Extended Data 
Fig. 10), suggesting that the high light sensitivity and strong levels of 
gene expression achieved by the current generation of optogenetic 
tools may be sufficient even in these challenging conditions. Even 
more potent responses to dim light could potentially be achieved by 
using well-characterized light-oxygen-voltage-sensing-domain (LOV) 
mutants with longer lit-state lifetimes!* (our current VP16-EL222 
remains in the photoactivated state for approximately 30s; ref. 12), or 
other photoactive proteins with highly stable light-switchable confor- 
mations’? (see Supplementary Discussion). For fermentation condi- 
tions that pose a greater challenge to light penetration than in this study, 
our OptoINVRT circuits offer a solution to control gene expression as 
they do not require light during the fermentation. 

We used optogenetic regulation of engineered metabolic pathways 
to address the long-standing challenge of ethanol competition in 
branched-chain alcohol production. We and others have developed 
triple PDC deletion strains (PDC1A, PDC5A, PDC6A) that recovered 
their ability to grow on glucose by directed evolution”’. However, efforts 
to produce isobutanol in these strains from mitochondrial (unpub- 
lished results) or cytosolic! biosynthetic pathways have thus far been 
unsuccessful. Our light-controlled metabolic valve offers an effective 
alternative to genetically deleting essential pathways that compete with 
a pathway of interest®. This allowed us to surpass the highest titres 
of isobutanol and 2-MBOH reported for yeast in the peer-reviewed 
literature that we have analysed by fivefold (Supplementary Table 6) 
and 20-fold”», respectively, making a strong case for the application of 
optogenetics to metabolic engineering. 

Optogenetic regulation of metabolic pathways enables new strate- 
gies for optimizing engineered pathways and fermentation conditions 
using periodic light pulses. Although further development for indus- 
trial applications is needed, in the future, light inputs delivered to a 
bioreactor could be automatically controlled in response to feedback 
from different fermentation outputs (for example, optical density or 
the readout of biosensors), providing unprecedented capabilities for 
operating, optimizing and automating fermentations for valuable 
product biosynthesis. 


Online Content Methods, along with any additional Extended Data display items and 
Source Data, are available in the online version of the paper; references unique to 
these sections appear only in the online paper. 
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METHODS 


Assembly of DNA constructs. We cloned promoter-gene-terminator sequences 
into previously described standardized vector series (pJLA vectors)”°. This allows 
for easy manipulation and generation of multi-gene plasmids. All genes were 
designed to have Nhel and Xhol restriction sites at the 5’ and 3’ ends, respectively, 
which were used to insert the genes into pJLA vectors. Each promoter-gene- 
terminator construct is flanked by Xmal and Agel restriction sites at their 5’ ends, 
and Mrel, Ascl and BspEI sites at their 3’ ends, which we used for easy assembly of 
multi-gene plasmids, as previously described (Supplementary Table 1). 

Qiagen Miniprep, Qiagen Gel Extraction and Qiagen PCR purification kits 
were used to extract and purify plasmids and DNA fragments. Most genes and 
promoters (ILV2, ILV3, ILV5, ARO10, LIAdhA®#! (ref. 32), GAL4, GAL80, GFP, 
Pap Press Pros, Peek Pcycn Papi) were amplified from yeast genomic DNA 
or laboratory plasmids, using Phusion polymerase from NEB, according to the 
manufacturer's instructions. Other genes were amplified from plasmids provided 
by other groups: PsLdh from plasmid pET28a, Ldh**** from J. J. Lee; and the 
photosensitive degron derived from the fusion of the LOV2-V19L domain of 
phototropin 1 from Arabidopsis thaliana and a synthetic degradation sequence 
derived from the murine (mouse) ornithine decarboxylase from plasmid pDS$143 
from C. Taxis”. The codon-optimized sequence for VP16-EL222 was purchased 
as a gBlock from IDT. The sequence for the Pci29 promoter was synthesized by 
the Bio Basic gene synthesis service. When pJLA vectors were not available, we 
used Gibson isothermal assembly to produce our constructs, based on published 
protocols**. Enzymes were purchased from NEB (Xmal, Ascl, Nhel, Xhol, BspEI, 
Agel, T4 DNA ligase, Phusion Polymerase) and Thermo Fisher Scientific (Mrel). 

We modified the single copy integration plasmid pNH603* to make a plasmid 
compatible with the pJLA vectors that can be used to introduce gene cassettes 
into the HIS3 locus. We first removed the AscI site of pNH603 and replaced 
the ADH] terminator (T4px1) sequence between PtsI and SacI with a fragment 
containing an Xmal restriction site. Then we introduced a cloning sequence array 
consisting of Xmal, Mrel, AscI and BspEI between the Xmal site (which replaced 
the T4pn1) and KpnI restrictions sites, to make pYZ12-B (Extended Data Fig. la, 
Supplementary Table 1). This addition makes pYZ12-B compatible with the pJLA 
platform of vectors, and allowed for easy transfer of gene cassettes from pJLA 
2\ plasmids. Gene constructs in pYZ12-B were integrated into the HIS3 locus 
of the genome by linearizing the plasmid with Pmel. We also used pRSII416* to 
introduce genes in a single copy episomal plasmid (CEN); this was done by insert- 
ing promoter-gene-terminator constructs cut from pJLA vectors with Xmal and 
Mrel and inserting at the Xmal site of pRSII416. 

Similar to pYZ12-B, we developed pYZ23 (Supplementary Table 1, Extended 
Data Fig. 1b), a pJLA vector-compatible plasmid, to integrate multiple copies of 
gene constructs into the -sites of the yeast genome. The pYZ23 plasmid targets 
the YARCdelta5, the 337-bp-long terminal repeat (LTR) of S. cerevisiae Ty] retro- 
transposons (YARCTy1-1, SGD ID: $000006792)**. We constructed pYZ23 with 
four overlapping DNA fragments using the Gibson isothermal assembly method”. 
The four fragments are: (1) the Pmel-linearized backbone fragment from pYZ12-B 
containing the ampicillin-resistance gene; (2) the first 207 bp of the YARCdelta5 
LTR; and (3) the last 218 bp of the YARCdelta5 LTR, both of which were ampli- 
fied from the BY4741 genome using primer pairs Yfz_Oli39 and Yfz_Oli40, and 
Yfz_Oli43 and Yfz_Oli44, respectively (Supplementary Table 7); and (4) the 
BleMX6 gene cassette from pCY 3090-07 (Addgene 36232), amplified using 
primers Yfz_Oli41 and Yfz_Oli42 (Supplementary Table 7), which add flanking loxP 
sites (lox66 and lox71) to the BleMX6 gene. Additional restriction sites, including 
Mrel, AscI and BspEL, were introduced for subcloning (Extended Data Fig. 1b, 
Supplementary Table 7). 

All vectors were sequenced using Sanger sequencing from GENEWIZ before 
using them to transform yeast. We avoid using tandem repeats to prevent recombi- 
nation after yeast transformation (Extended Data Fig. 1c), and thus do not observe 
instability of plasmids. 

Yeast transformations. Yeast transformations were carried out using standard 
lithium acetate protocols, and the resulting strains are catalogued in Supplementary 
Table 2. Gene constructs derived from pYZ12-B and pYZ23 vectors were genomi- 
cally integrated into the HIS3 locus and 6-sites (YARCdelta5), respectively. These 
vectors were first linearized with Pmel and DNA fragments were purified using 
a Qiagen PCR purification kit before using them for yeast transformation. Gene 
deletions were carried out using homologous recombination strategies. DNA 
fragments containing antibiotic resistance cassettes flanked with loxP sites were 
amplified using PCR from pAG26* (containing the hygromycin resistance gene, 
hygromycin B phosphotransferase (HygB-PT)), pUG6” (containing the G418 
resistance gene KanMX), or pAG36*° (containing the nourseothricin resistance 
gene NAT1), using primers with 40 bp of homology to the promoter and terminator 
regions of the gene targeted for deletion. Antibiotic-resistance markers were 
subsequently removed by expressing Cre recombinase from the pSH62 (AF298785) 


vector’). After transformation, cells were plated on synthetic complete (SC) drop 
out medium depending on the auxotrophy restored by the construct. In the case 
of antibiotic selection, cells were plated onto non-selective YPD plates for 16h, 
then replica plated onto YPD plates with 300j1g ml“! hygromycin (purchased from 
Invitrogen), 200j:g ml“! nourseothricin (purchased from WERNER BioAgents), or 
200 1g ml“! G418, purchased from Gibco by Life Technologies). Zeocin was used 
to select for §-integration at concentrations ranging from 800 to 1,200pg ml! 
(purchased from Thermo Fisher Scientific). 

All strains with genomic integrations or gene deletions were genotyped with 

PCR to confirm their accuracy. We integrated constructs in the HIS3 locus or 
6-sites to promote strain stability. 
Yeast cell culture growth, centrifugation and optical measurements. Unless 
otherwise specified, liquid yeast cultures were grown in 24-well plates, at 30°C and 
shaken at 200r.p.m., in either YPD or SC-dropout medium supplemented with 2% 
glucose. SC-dropout media include 94.8 mg] of valine, 94.8 mg]! of isoleucine 
and 189.6 mg]! of leucine along with all other necessary standard nutrients unless 
otherwise specified. To stimulate cells with light, we used blue light-emitting 
diode (LED) panels (HQRP New Square 12-inch Grow Light Blue LED 14W), 
placed 40 cm from cell cultures. To control light duty cycles, the LED panels were 
regulated with a Nearpow Multifunctional Infinite Loop Programmable Plug-in 
Digital Timer Switch (purchased from Amazon). To resuspend cells in new media, 
cell cultures were centrifuged in a table-top centrifuge, with 24-well plate rotor 
adaptors. Unless otherwise specified, plates were centrifuged at 234g for 5 min. 

Fluorescence and OD¢o9 measurements were taken using a TECAN plate 
reader (infinite M200PRO). The excitation and emission wavelengths used for 
GFP fluorescence measurements were 485 nm and 535nm, respectively, using an 
optimal gain for all measurements. To process fluorescence data, the background 
fluorescence from the medium was first subtracted from values. Then, the GFP/ 
OD6g00 values of cells lacking a GFP construct were subtracted from the fluores- 
cence values (GFP/ODg0o) of each sample to normalize for light bleaching of the 
medium and cell contents. Thus, reported values were calculated according to the 
following formula 


__ GFP strain,condition — GFP media,condition 


GFP/OD.,_. tg 
/ strain, condition ~~ QD strain,condition — OD media,condition 


GFP no GEP control strain,condition — GFP media,condition 


OD no GEP control strain,condition ~— OD media,condition 


All fluorescence measurements were done at the end of experiments or on 
samples taken from experimental cultures, such that potential activation of VP16- 
EL222 by the light used to excite GFP did not affect our experiments or results. 

To measure cell concentration, optical-density measurements were taken at 
600 nm, using medium (exposed to the same conditions as the yeast) as a blank. 
Measurements were performed using the TECAN plate reader (infinite M200PRO) 
or Eppendorf spectrophotometer (BioSpectrometer basic), from samples diluted 
to a range of ODgop of 0.1 to 1.0. 

All experiments involving light-inducible strains were done under minimal 
ambient light, unless otherwise specified, to avoid unwanted activation of 
optogenetic systems. 

Construction of OptoEXP system. We purchased a gBlock (IDT) containing 
the yeast codon-optimized sequence of VP16-EL222, flanked by Nhel and 
Xhol restriction sites, which we inserted into plasmid pJLA121%"! (containing 
Ppex; and Tcyc;). We then used Xmal and AsclI to subclone Ppgx;-VP16- 
EL222-Tcyc) into pYZ12-B to make EZ-L105 (Supplementary Table 1), which 
allows single genomic integration of gene cassettes into the HIS3 locus. We then 
changed the promoter to Pygp; using Xmal and Nhel restriction site cutting and 
subsequent ligation to make EZ-158, used for OptoEXP expression in glycerol 
media. In addition, we synthesized the C120 and minimal promoter sequence 
(TAGAGGGTATATAATGGAAGCTCGACTTCCAG), otherwise known as 
Pcizo using Bio Basic’s gene synthesis service and used it to develop new pJLA 
vectors with the P¢j29 promoter and either an ADH1 or ACT] terminator, making 
pJLA121 or pJLA121°, respectively (Supplementary Table 1). 
Characterization of OptoEXP. To test the OptoEXP system, we built plasmid 
EZ-L83 (pJLA111-GFP®), which places GFP under Pc jz transcriptional control 
in a CEN/ARS plasmid with a URA3 marker (Supplementary Table 1). We then 
used EZ-L105 to integrate a single copy of Ppgx;-VP16-EL222-Tcyc) construct 
into the HIS3 locus of CENPK.2-1C, selecting strain YEZ24 from a SC lacking 
His (SC-His) plate supplemented with 2% glucose. Subsequently, we transformed 
YEZ24 with EZ-L83, and selected strain YEZ32 from a SC plate lacking uracil 
(SC-Ura) plate supplemented with 2% glucose. We also transformed YEZ24 with 
empty pRSII416 to make control strain YEZ32C, which has no GFP. 

To characterize light induction by OptoEXP, we tested four different colonies 
of YEZ32. We grew cells from each colony in liquid SC-Ura overnight in the dark 
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(tin foiled). The next morning, we diluted the cells to OD¢00 of 0.1 in fresh SC-Ura, 
to a final volume of 1 ml each, incubated in individual wells of a 24-well clear 
Costar plate. We prepared five identical plates and tin foiled one of them. The 
plates were then placed in the dark (tin-foil-covered plate), under constant blue 
light, or under blue light with duty cycles of 5s on and 75s off, 8s on and 72s off 
and 11s on and 69s off, with a blue LED panel at 40cm from each plate. We used 
duty cycles instead of light intensity to better control light dose and reproducibility 
(Extended Data Fig. 2). Cell cultures were grown for 8h under blue light panels 
or in the dark, then their GFP fluorescence and OD¢oo were measured using the 
TECAN plate reader. Error bars represent the s.d. from biological replicates (n= 4). 

To benchmark the combination of Ppgx;-VP16-EL222-Tcyc; and a single copy 
of Pciz0, which we refer to as OptoEXP, we compared it to several constitutive 
promoters. To achieve this, we made pJLA111-GFP°*™ constructs (CEN/ARS 
with URA3 markers) containing Pcyc; (EZ-L64), Papi (EZ-L63), Ppgxi (EZ-L67), 
Prpx3 (EZ-L65) and Prep; (EZ-L66), which we used to transform YEZ24 to make 
yeast stains YEZ27 (EZ-L64), YEZ28 (EZ-L63), YEZ29 (EZ-L67), YEZ30 (EZ-L65) 
and YEZ31 (EZ-L66). 

YEZ32C was used as a control with no GFP production for strains with 

pJLA111-GFP°*™ plasmids. YEZ27, YEZ28, YEZ29, YEZ30, YEZ31 and YEZ32 
were tested similarly to the OptoEXP testing above (YEZ32). 
Flow cytometry experiments. We used flow cytometry to determine whether the 
OptoEXP system produces a homogeneous response in the cell population. To 
construct strains for these experiments, we transformed CEN.PK2-1C with Pmel- 
linearized pYZ12-B (a non-fluorescent control), EZ-L136, EZ-L516 or EZ-L350 
to make YEZ140 (CEN.PK2-1C with histidine prototrophy restored), YEZ139 
(CEN.PK2-1C with OptoEXP driving GFP, and Ppgx; driving VP16-EL222, which 
works best in glucose), YEZ243 (CEN.PK2-1C with OptoEXP driving GFP, and 
Prep; driving VP16-EL222, which works well in glucose or glycerol) or YEZ186 
(CEN.PK-2C with HIS3::P r¢p;-GFP-T cr, for constitutive expression of GFP as 
control), respectively. 

To test the homogeneity of gene expression of OptoEXP we grew overnight 
cultures of YEZ139, YEZ140 and YEZ186 in SC-His medium supplemented with 2% 
glucose in the dark. The next morning, we diluted 2011 of these cultures into 98011 
of fresh medium in two 24-well plates. Both plates were placed in the dark and 
shaken at 200 r.p.m. at 30°C for 3h. Then, one plate was placed 40 cm below a 
blue light panel and the other was tin foiled and kept in the dark. Both plates were 
shaken at 2001.p.m. at 30°C for 3h. Then, 5 1l of culture was diluted into 995 jul 
of PBS and used for flow cytometry (Extended Data Fig. 2a, b). Samples were 
run in triplicates from three different cultures separated after the overnight stage. 
Representative samples from these triplicates were chosen for the figures. 

To test OptoEXP performance in non-fermentative conditions, we grew 10-ml 
overnight cultures of YEZ243, YEZ140 and YEZ186 in SC-His medium supple- 
mented with 3% glycerol and 2% ethanol, in tin-foiled tubes (in the dark) in the 
roller drum. The next morning, we diluted 30,11 of these cultures into 98011 of 
fresh medium to an ODg09 of 0.05-0.1 in four 24-well plates. All four plates were 
placed in the dark (tin foiled) and shaken at 2001r.p.m. at 30°C for 6h (until the 
yeasts were in the exponential growth phase, OD¢oo of approximately 3). Then, one 
plate was placed 40 cm below a blue light panel and the others were kept tin foiled 
(in the dark). After 1h, another plate was moved into the light. After another half 
hour more, another plate was moved into the light. All plates were kept shaken at 
200 r.p.m. at 30°C for 2h after the initial 6h of growth. Then, 511 of each culture 
was diluted into 995 11 of PBS medium and used for flow cytometry (Extended 
Data Fig. 2c). Samples were run in triplicates from three different cultures 
separated after the overnight stage. Representative samples from these triplicates 
were chosen for the figures. 

GFP expression was quantified by flow cytometry using a BD LSR I] flow 
cytometer (BD Biosciences) with an excitation wavelength of 488 nm and an 
emission wavelength of 530nm. The gating used in our analyses was defined to 
include positive (YEZ186) and negative (YEZ140) control cells based on GFP 
fluorescence, but exclude particles that are either too small or too large to be single 
living yeast cells, based on the side scatter (SSC-A) versus forward scatter (FSC-A) 
plots (Extended Data Fig. 2d). Mean fluorescence values were determined from 
20,000 cells. Data were analysed with the FlowJo v.10 software (Tree Star). 

All fluorescence measurements were performed at the end of experiments or on 
samples taken from experimental cultures, such that potential activation of VP16- 
EL222 by the light used to excite GFP did not affect our experiments or results. 
Construction of the OptoINVRT circuits. OptoINVRT circuits were 
initially developed and characterized in two yeast strain backgrounds: YEZ44 
(CENPK.2-1C, GAL80A, GAL4A) and Y202 (S288C, PDC1A, PDC5A, PDC6A, 
GAL80A, containing pJLA121-PDC1””), (Supplementary Table 2). 

Gene circuits were assembled using restriction enzyme digests and ligations 
afforded by pJLA vectors, in which open reading frames were inserted using Nhel 
and Xhol sites, and multiple cassettes assembled using Xmal (or Agel), Mrel 
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(or BspEI) and AscI**. Each gene circuit was constructed by assembling five 
promoter-gene-terminator sequences into single integration vectors targeting 
the HIS3 locus (EZ-L259, EZ-L260 and EZ-L266; Supplementary Tables 1, 3). 
The PSD used in OptoINVRT3 is derived from the fusion of phototropinl LOV2 
domain (V19L mutant) from A. thaliana and a synthetic degradation sequence 
derived from the murine ornithine decarboxylase”. This PSD was fused to the C 
terminus of GAL4 using Gibson assembly of pJLA121°°°? cut with Xhol and PSD 
amplified from pDS143” with overhang sequences of 30 bp that are homologous 
to the multiple cloning sequence and T api of pJLA121™" to make EZ-L247, to 
which we then inserted GAL4 using Nhel and Xhol through ligation to make the 
precursor plasmid of EZ-L266 (EZ-L251). 

Characterization of OptoINVRT circuits. The three OptoINVRT circuits, 
controlling GFP expression, were initially characterized by transforming 
YEZ44 to produce YEZ100 (OptoINVRT1), YEZ101 (OptoINVRT2) and 
YEZ102 (OptoINVRT3), and Y202 to produce YEZ115 (OptoINVRT1), 
YEZ116 (OptoINVRT2) and YEZ117 (OptoINVRT3) (Supplementary Table 2). 
OptoINVRT circuits were characterized using strains YEZ100-102 and YEZ115- 
117 by performing the same experiment done on YEZ32, described above, to 
characterize the OptoEXP system. In this case, we exposed cells to full light, 
complete darkness, or light pulses of 8s on and 72s off. YEZ186 and YEZ140 were 
used as controls for strains derived from YEZ44, and YEZ171 and YEZ94 were 
used as controls for strains derived from Y202. 

Characterization of light panels. The light intensity of panels was measured 
using a Quantum meter with a separate sensor (Model MQ-510 from Apogee 
Instruments), placed 40 cm from the light panel of interest (the same distance was 
used for all 24-well plate experiments). The resulting intensities ranged from 53 
to 134jzmolm ~?s~! with an average of 77.5;1molm *s~! (with 465 nm max peak 
spectra). Ambient light under these conditions ranged from 4-5 1molm~*s~!. We 
then measured the levels of GFP expression in strains YEZ243 and YEZ186 (as well 
as YEZ140, as a negative control) induced by three different panels with 53, 82 or 
134jzmolm~?s~? intensities, placed 40 cm from the 24-well plates, using the same 
protocol for experiments shown in Fig. 1b. Overnight cultures grown in the dark 
where diluted to an ODgoo of 0.1, and incubated for 8h in either full light, or light 
pulses of 10s on and 70s off. The levels of GFP expression were indistinguishable 
across the three panels tested (including the brightest and dimmest panels used in 
this study), and responded uniformly to changes in light duty cycle (Extended Data 
Fig. 3). This demonstrates that there is no detectable difference between variations 
in the intensities of panels used in this study and that varying duty cycle is an 
effective way to control gene expression using our optogenetic transcriptional 
controls. 

Development of an OptoEXP-PDC strain. Strain Y200 contains a triple gene 
deletion of PDCIA, PDC5A and PDC6A, as well as a 2\.- URA3 plasmid pJLA121- 
PDC1? with Prg21-PDC1-Tacr1, which allows it to grow robustly in glucose. 
Y200 was transformed with a Pmel-linearized EZ-L165 plasmid to insert a 
cassette composed of Prep;-VP16-EL222 -Tcyc; and P¢j29-PDC1-T apm into its 
HIS3 locus, resulting in strain YEZ50 (Supplementary Tables 1, 2). As a control, 
we also transformed Y200 with Pmel-linearized EZ-L158, a vector containing 
Prepj-VP16-EL222-T yc; but lacking Pc¢129-PDC1-T 4px, and then counter- 
selected against the pJLA121-PDC1° plasmid by growing on 5-fluorootic acid 
(5-FOA) (as described below) to produce the control strain YEZ50C. Strain YEZ50 
was then transformed with Pmel-linearized EZ-L143, which inserts multiple copies 
of Pci29-PDC1-T 4pm into 6-integration sites of the yeast genome. Colonies able to 
grow on YPD plates containing 800 1g ml“ of zeocin were replica plated on plates 
containing SC-His with 3% glycerol and 2% ethanol twice. The resulting plates 
were then replica plated on SC-His with 3% glycerol, 2% ethanol and 1mgml"! 
5-FOA, and then finally back onto plates containing SC-His with 3% glycerol and 
2% ethanol. This treatment efficiently counter-selects against pJLA121-PDC1°™, 
due to its URA3 marker”. From this plate, we isolated YEZ61-23, a strain that can 
grow on SC-His with 2% glucose plates only when exposed to blue light, which is 
consistent with having PDC1 expression controlled by Pci29, and VP16-EL222. As 
a control, we also transformed Y200 with Pmel-linearized EZ-L143, which contains 
Pey29-PDC1-T apy but lacks P-rgp7-VP16-EL222-Tcyc;, and counter-selected 
against in pJLA121-PDC1” in 5-FOA to produce YEZ61C. This control strain 
has multiple copies of Pcj29—-PDC1-T pu in 6-sites, but lacks the VP16-EL222 
needed to transcribe them. 

Light-dependent growth of an OptoEXP-PDC strain. Solid media. Cells from 
strains BY4741, YEZ61-23, YEZ50C and YEZ61C were patched onto an agar plate 
containing yeast extract, peptone and 3% glycerol and 2% ethanol (YPGE plate), 
and grown over night in ambient light. This plate was then replica plated onto a 
fresh YPGE plate and two YPD plates. One of the YPD plates was covered in tin foil 
and the other exposed to constant blue light, while the YPGE was left at ambient 
lighting. Replica plating was performed such that the YPD plate ultimately covered 
in tin foil was replicated first, the YPD plate ultimately exposed to constant blue 
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light was replicated second, and the YPGE plate was replicated last. All plates were 
incubated at 30°C for 48 h (Fig. 2b). 

Liquid media. Single colonies of YEZ61-23 and BY4741 were used to inoculate 
liquid SC-His with 2% glucose medium. The cells were grown for 24h on a roller 
drum at 200 r.p.m., 30°C and 40 cm away from a blue light source (HQRP New 
Square 12-inch LED Grow Light System 225 Blue LED 14W). Subsequently, the 
OD was measured with a spectrophotometer, and the cells were diluted to an 
OD600 of 0.1 in fresh SC-His medium supplemented with 2% glucose medium in 
three 1-ml replicates in four 24-well plates (Celltreat non-treated sterile flat bottom 
plates). Each plate was centred 40cm under an HQRP blue light panel. One plate 
was exposed to full blue light. The other three plates were exposed to 10s on and 
70s off of blue light, 20s on and 60s off of blue light or 40s on and 40s off of blue 
light. A fifth plate was wrapped in aluminium foil to incubate cells in the absence 
of light. OD readings were taken using a TECAN plate reader at 0h, 10h, 12.5h, 
13.5h, 15h, 17h, 19h, 20.5h and 31.75h. Readings were taken under minimal 
light conditions to prevent unwanted activation of EL222. Data are available in a 
supplementary spreadsheet. 

The exponential growth phase of YEZ61-23 (identified as the most linear 

portion of the plot of log.(OD) against time), was used to find the specific 
growth rates at different light doses. This was done by fitting the data to 
log-(OD) =loge(ODo) + pu x t, using least squares linear regression, where ODo 
is a constant corresponding to the initial OD, and 1 corresponds to the specific 
growth rate constant. The /1 constants were calculated for each independent experi- 
ment and then averaged, with error bars representing s.d. (n = 3). Source Data are 
available with the online version of this paper. 
Development of chemical production strains. To develop light-dependent 
lactic acid-producing strains, we transformed Y202 with Pmel-linearized EZ-L259, 
EZ-L260 and EZ-L266 (OptoINVRT1, OptoINVRT2 and OptoINVRT3 respec- 
tively) yielding YEZ115, YEZ116 and YEZ117. We then integrated multiple 
copies of Pmel-linearized EZ-L235 (Supplementary Table 1), which contains P¢z29 
driving PDC1 and Pgar) driving the Ldh from Pelodiscus sinensis (provided by J. 
Lee) into 6-sites. Then we counter-selected against pJLA121-PDC1” to produce 
YEZ144 (OptoINVRT1), YEZ145 (OptoINVRT2) and YEZ146 (OptoINVRT3) 
(Supplementary Table 2). These strains induce PDC1 and repress Ldh expression 
in the light; while in the dark they stop inducing PDCI and induce Ldh instead. 

To develop light-dependent isobutanol-producing strains, we transformed 
YEZ115, YEZ116 and YEZ117 with Pmel-linearized EZ-L316, which inte- 
grates multiple copies of Pci29-driven PDCI and Pgaz1-driven ILV2 in genomic 
6-integration sites. We then counter-selected the transformants against pJLA121- 
PDC1° with 5-FOA to produce strains YEZ131 (OptoINVRT1), YEZ149 
(OptoINVRT2) and YEZ133 (OptoINVRT3), respectively. Subsequently, we 
transformed these strains with plasmid EZ-L310 (Supplementary Table 1), which 
contains five genes from the mitochondrial isobutanol biosynthetic pathway: 
ILV2, ILV5, ILV3, COX4-ARO10 and COX4-LIAdhA™" (ref. 25). The last two 
genes in EZ-L310 are fused by their N termini to the mitochondrial localization 
signal of COX4, ensuring that all five genes are targeted to the mitochondria”. In 
addition, Pgaz; drives ILV2 expression, which places this gene under the control 
of the OptoINVRT circuits. The resulting strains are YEZ159, YEZ156 and HPY6 
for OptoINVRT1, OptoINVRT2 and OptoINVRT3, respectively (Supplementary 
Table 2). 

Screens for lactic acid- and isobutanol-producing strains. Colonies from each 
transformation plate (grown in glucose and under blue light) were screened for 
lactic acid (7 colonies from each of YEZ144, YEZ145 and YEZ146) or isobutanol 
(12 colonies from each of YEZ159, YEZ156 and HPY6) production (Supplementary 
Table 2). Each colony was used to inoculate 1 ml SC-His medium supplemented 
with 2% glucose (for lactic acid-producing strains) or SC-Ura medium supple- 
mented with 2% glucose (for isobutanol-producing strains) in 24-well plates and 
grown overnight at 30°C, 200r.p.m. and blue light. The next morning, each culture 
was diluted to an ODgoo of 0.1 for lactic acid-producing strains, or an OD¢o9 of 0.15 
for isobutanol-producing strains, in fresh medium, and grown for 12h (for lactic 
acid production) or 18h (for isobutanol production), at 30°C, 200 r.p.m. and under 
blue light. After these incubation periods, the cultures reached OD¢o0 values of 5 
(for lactic acid-producing strains, =5) and 5 and 8 (for isobutanol-producing 
strains, p= 8, p=5); at which point they were moved into the dark for 6h for 
lactic acid-producing strains (9 = 6h) and 3h for isobutanol-producing strains 
(@=3h). The cultures were then centrifuged and re-suspended in fresh medium, 
in plates that were then sealed with Nunc Sealing Tape (Thermo Fisher Scientific) 
to begin the fermentations. The plates were incubated in the dark at 30°C and 
shaken at 200 r.p.m. for 48h during fermentation. Subsequently, the cultures were 
centrifuged, and the supernatants collected for high-performance liquid chroma- 
tography (HPLC) analysis. For lactic acid-producing strains, most of the seven 
colonies screened for each OptoINVRT circuit produced similar amounts of lactic 
acid, with colonies of strain YEZ145 (containing OptoINVRT2) producing the 


highest titres (Extended Data Fig. 4c). On the other hand, out of the 12 colonies 
screened for each isobutanol-producing strain, only 2 or 3 produced high titres for 
each OptoINVRT circuit, with YEZ159 (containing OptoINVRT1) producing the 
highest titres (Extended Data Fig. 4d, e). 

Construction of a high isobutanol-producing strain. We deleted BAT1 from 
YEZ131 via homologous recombination, using the HygB hygromycin-resistance 
marker, resulting in strain YEZ158. Subsequently, we transformed YEZ158 with 
EZ-L310, resulting in transformants YEZ167, from which we screened 7 colonies, 
as described above, and identified YEZ167-4 as the strain with highest isobutanol 
production. 

Optimizing experimental parameters for light-dependent fermentation. The 
colonies producing the highest titres from YEZ144, YEZ145 and YEZ146 (for lactic 
acid) and of YEZ167 (specifically YEZ167-4 for isobutanol), were used to optimize 
the pre-growth parameters of fermentation for lactic acid or isobutanol production. 
For each strain, an overnight culture was grown in blue light at 30°C with shaking 
at 200r.p.m., in SC medium supplemented with 2% glucose (SC-His for lactic 
acid-producing strains and SC-Ura for isobutanol-producing strains). To find the 
optimal cell density at which to switch cultures from light to dark, we diluted 
the overnight cultures into 1 ml SC-dropout medium to different OD¢00 values, 
ranging from 0.04 to 0.32. The lactic acid-producing strains were then grown for 
16h under 15s on and 65s off blue light cycles. The isobutanol-producing strains 
were grown for 18h under 15s on and 65s off blue light cycles. We then measured 
the ODgoo of each culture (these values correspond to variations in p), and incu- 
bated them in the dark for 6h for lactic acid-producing strains (@=6h) and 3h for 
isobutanol-producing strains (@= 3h). After this dark incubation period, the 
cultures were centrifuged at 234g for 5 min and suspended in fresh SC-dropout 
medium containing glucose at 26.5 gl’ (for lactic-acid-producing strains) or 
21.5 gl"! (for isobutanol-producing strains). The plates were sealed with Nunc 
Sealing Tape, and incubated in the dark for fermentation at 30°C and 200r.p.m. 
Control cultures were grown under 15s on and 65s off blue light during the growth 
phase, the dark incubation period, and the fermentation. Cultures producing lactic 
acid were harvested after 48h, while samples of cultures producing isobutanol 
were taken after 24, 48 and 72h. Cultures were centrifuged at 234g for 10 min, and 
supernatants were analysed with HPLC. 

To optimize the dark incubation period immediately before fermentation (0), 
the best isobutanol-producing strain, YEZ167-4, was grown overnight under 
15s on 65s off blue light in SC-Ura medium supplemented with 2% glucose. The 
overnight culture was then diluted into seven different plates in quadruplicate 
samples in fresh medium to a starting ODgoo of 0.1. The cultures were then grown 
to an OD600 of 8.5 (which was found to be the optimal OD¢o9 in our previous 
experiment). At that point, the plates were tin foiled to ensure complete darkness. 
After every hour, one of the plates was centrifuged, and the cells suspended in 
fresh SC-Ura medium with 20.8 g1~! glucose medium and subjected to a 48-h 
fermentation in the dark. 

Optimizing the frequency of light bouts during the production phase of batch 
fermentation. A single colony of the best isobutanol-producing strain, YEZ167-4, 
was used to inoculate 5 ml SC-Ura medium supplemented with 4% glucose and 
grown overnight under light. The next morning, the culture was diluted in 1 ml 
fresh medium to an ODgo0 of 0.2 (in quadruplicates) and grown under light for 20h 
to an ODg00 of 9.5. Subsequently, the cultures were incubated for 3h in the dark. To 
start the fermentations, the cultures were centrifuged again, and suspended in fresh 
SC-Ura medium supplemented with 15% glucose (precisely 157.0 g~! glucose, as 
measured with HPLC), and kept in the dark. During the fermentation, the cultures 
were first exposed to 4h of light and then pulsed every 5, 10, or 20h for 30 min, at 
a duty cycle of 15s on and 65s off. As controls, some plates were always kept in the 
dark or in full light. Fermentations lasted for 80h, after which the cultures were 
centrifuged, and the supernatants were analysed with HPLC. 

NAD*/NADH ratio measurements. YEZ167-4 was inoculated into 3 ml SC-Ura 
medium supplemented with 2% glucose and grown overnight under light at 30°C 
and 200r.p.m. The next morning, the culture was diluted into 1 ml fresh medium in 
6 different 24-well plates to an OD¢o9 of 0.2 (in quadruplicates) and grown at 30°C 
and shaken at 200r.p.m. under light for 20h to an OD¢o9 of 9.5. Subsequently, the 
cultures were incubated under the same conditions for 3h in the dark. The cultures 
were then centrifuged at 234g for 5 min and resuspended in fresh SC-Ura medium 
supplemented with 15% glucose (precisely 157.0 gl! glucose, as measured with 
HPLC), and kept in the dark. As controls, one plate was always kept in the dark 
and one plate was kept in full light. The remaining four plates were pulsed every 
10h for 30 min, at a duty cycle of 15s on and 65s off. One plate was exposed to an 
extra 30-min pulse of light at a duty cycle of 15s on and 65: off precisely 4h before 
harvest and placed back into the dark for the remainder of the fermentation. One 
plate was exposed to an extra 30-min pulse of light at a duty cycle of 15s on and 
65s off precisely 2h before harvest and placed back into the dark for the remainder 
of the fermentation. One plate was exposed to an extra 30-min pulse of light at 
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a duty cycle of 15s on and 65s off precisely 30 min before harvesting to test how 
the light pulses affect the NADt/NADH ratio. After 48h, cells were harvested by 
centrifuging at 9,000g for 5 min and removing all supernatant. The NAD*/NADH 
ratios of the pelleted cells were measured as previously described®’. 
Light-dependent growth in a 2-1 bioreactor. To test whether light penetration 
becomes limiting to a strain with light-dependent growth, we used a 2-1 photo- 
bioreactor to compare the growth of YEZ167-4 (which is an isobutanol-producing 
strain with light-dependent growth) to that of YZy335 (which is a strain that con- 
stitutively produces isobutanol and is PDC‘, and thus its growth and isobutanol 
production are independent of light). 

We used a single colony of YEZ167-4 or YZy335 to inoculate 5 ml SC-Ura 
medium supplemented with 2% glucose under light, overnight. The next morning, 
we diluted the culture in 0.5 1 SC-Ura medium supplemented with 15% glucose to 
an OD6¢o0 of 0.1 ina UTEX 2-1 glass photobioreactor surrounded by three blue light 
panels, placed 1 cm from its glass wall (Extended Data Fig. 9a, b), instead of the 
UTEX LED lighting platform. The culture was grown for 84h at 30°C and under 
constant blue illumination, with cells gently mixed with a magnetic stir bar and air 
sparging. Samples were taken every 12h to measure the OD¢oo of the cell cultures 
(Fig. 3d). We report the average measurements of three independent fermentations 
for each strain, and the error bars correspond to the standard deviations of those 
three measurements. 

Fed-batch fermentation. To test the capability of these systems to utilize higher 
amounts of glucose in a laboratory-scale fermenter, we conducted fed-batch experi- 
ments using the Sixfors INFORS AG CH-4103 Boltmingen/Switzerland based on a 
previously described protocol. We did not sparge air to keep the fermentation in 
microaerobic conditions. The pH was set to 5.5 and maintained with a 0.5 M KOH 
feed. We autoclaved the 500-ml fermenter with 250 ml ddH,0 and exchanged the 
ddH,0 (using the air pump) with filtered 250 ml SC-Ura medium supplemented 
with 10% glucose. We used a single colony of YEZ167-4 to inoculate 5 ml SC-Ura 
medium supplemented with 2% glucose under light, overnight. The next day, we 
added the inoculum to the fermenter to obtain an OD¢09 of 0.2. The fermentation 
was kept at 30°C and mixed at 200 r.p.m. Two light panels were placed 0.5cm 
away from the vessel walls (Extended Data Fig. 9c, d), and kept on for the first 
46 h (growth phase of the fermentation), in which the culture grew to an OD¢o0 of 
8.2 +0.1. The culture was then covered with black cloth and the light was turned 
off for 4h. We then started a glucose feed of SC-Ura medium supplemented with 
50% glucose at a rate of 2.9 mlh™! for 90h (run 1) or 120h (runs 2 and 3) and 
turned on the light (continuously) for the first 4h of the glucose feed. After the 
initial 4h, the light was switched to a duty cycle of 45 min on and 7h 15 min off for 
the remainder of the fermentation (production phase). Samples of 1 ml were taken 
at hours 0, 24, 48, 72, 96, 120, 144, 168, 192, 216, 240, 264 (after inoculation), for 
both OD¢o9 measurements, and HPLC analysis. Samples collected at 48, 96, 144, 
192 and 264h were also analysed with qPCR. 

Analysis of chemical concentrations. The concentrations of glucose, lactic acid, 
ethanol, isobutanol and 2-methyl-1-butanol were quantified with HPLC, using an 
Agilent 1260 Infinity instrument (Agilent Technologies). Samples were centrifuged 
to remove cells and other solid debris at 234g for 10 min, and the supernatants were 
then centrifuged for 30 min at 17,000g and 4°C prior to loading onto an Aminex 
HPX-87H ion-exchange column (Bio-Rad). The column was eluted with a mobile 
phase of 5 mM sulfuric acid at 55°C and a flow rate of 0.6 ml min“!. Glucose, 
lactic acid, ethanol, isobutanol, and 2-methyl-1-butanol were monitored with a 
refractive index detector (RID). To determine their concentration, the peak areas 
were measured and compared to those of standard solutions for quantification. 
Quantification of gene expression and 6-integration copy number. To measure 
the number of copies of P¢j2>—PDC1 integrated into the genome of YEZ50lost, 
YEZ61-23, YEZ144, YEZ145, YEZ146 and YEZ167-4, we ran qPCR experiments on 
their genomic DNA, extracted with phenol-chloroform*. As a control strain con- 
taining a single copy of PDCI, we developed YEZ5Olost by counter-selecting against 
plasmid pJLA121-PDC1™” from strain YEZ50, using 5-FOA as described above. 

To quantify the expression of PDCI and ILV2 during the fed-batch fermentation 
experiments, we measured transcript levels using qPCR. We collected-5 ml samples 
over the course of the fermentation and centrifuged them at 900g for 1.5 min. The 
mRNA was then extracted with phenol-chloroform*. Concentrations of samples 
were measured using the absorbance at 260 nm/280 nm. The QuantiTect Reverse 
Transcription Kit (Qiagen) was used to remove genomic DNA and synthesize cDNA. 

qPCR was performed using the Bio-Rad Mini Opticon and the iTaq Universal 
SYBR Green Supermix (Bio-Rad). Reactions contained the appropriate amounts of 
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reagents as specified by kit protocols. Threshold cycle values were set by StepOne 
software (Thermo Fisher Scientific) and melting curve analysis following amplifi- 
cation was used to verify the quantity of single PCR products. Primers specific for 
PDCI, ILV2 and ACT] are listed in Supplementary Table 7. The expression level of 
each gene was normalized to that of ACT! from the same samples. 

Testing gene expression of OptoEXP in 5-1 fermenter and higher cell densities. 
To test OptoEXP in higher cell density conditions, we inoculated YEZ243 into 
SC-His medium supplemented with 3% glycerol and 2% ethanol and grew in the 
dark for 24h. We then set up a BioFlo120 system with a 5-1 bioreactor (Eppendorf, 
B120110002) and added 2.41 of SC-His medium supplemented with 3% glycerol 
and 2% ethanol after autoclaving. The reactor was set to 30°C, pH of 6.5, anda 
minimum dissolved oxygen percentage of 40. Blue LED strips (Shenzhen Shinesky 
Optoelectronics, SMD3528) were wrapped around the reactor, covering 73% of 
the available bulk surface area of the fermentation (Extended Data Fig. 10a, b). 
These LED strips emitted light at an intensity of 129 1molesm ~*~! (with 465 nm 
max peak spectra) measured in the same condition as the light panels. The reactor 
was then inoculated to an ODgo9 of 1 and the cells were grown in the dark until an 
OD600 of 15 (maintained by covering the reactor with black fabric). At an OD¢00 
of 15, the lights were turned on and samples were taken at an OD6o9 of 16 (1h 
after induction), 19 (6h after induction), 41 (24h after induction), 46 (32h after 
induction) and 50 (40h after induction). Cells were fixed by diluting them 
to an ODgo9 of 1 in SC-His medium with 3% glycerol and 2% ethanol, adding 
paraformaldehyde to 3.7%, and incubating for 1h at 25°C. To prepare samples for 
flow cytometry, cells were washed twice with Dulbecco’s PBS, and re-suspended 
again in Dulbecco's PBS to an ODg09 of 0.5. 


Statistics. Statistical significance was determined using a standard t-test for P values. T 


(meancondition 2 — MeaNcondition 1),{number of samples 


s.d. condition 2 

P values were calculated using a degree of freedom of 2 and a one-sided f-test 
calculator. 

Data availability. The authors declare that all data supporting the findings of this 
study are available within the paper (and its Supplementary Information files), 
but original data that supports the findings are available from the corresponding 
authors upon reasonable request. 
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pYZ12-B lox66 


Extended Data Figure 1 | Maps of key vectors used in this study. transformation. c, The general vector map shows the relative orientation 

a, pYZ12-B vector used to integrate genes or circuits into the HIS3 of the five positions listed in Supplementary Table 1, in which different 
locus. Constructs are usually transferred from pJLA vectors using XmaI genes (including promoters and terminators) were assembled, using a 

and Ascl sites. Final constructs are linearized with Pmel before yeast previously described multiple gene insertion strategy**. All vectors have an 
transformation. b, pYZ23 vector used to integrate genes or circuits into 6-5 — ampicillin-resistance marker (AMPR) for cloning in E. coli and a selection 
sites of yeast. Constructs are usually transferred from pJLA vectors using marker for S. cerevisiae (Marker). Vector types include CEN/ARS, 21, or 


Mrel and AscI sites. Final constructs are linearized with Pmel before yeast _integrative***?. 
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Extended Data Figure 2 | Flow cytometry of strain with GFP controlled 
by OptoEXP. Representative flow cytometry graphs from three biological 
replicates under the same conditions. Experiments were performed in 
24-well plates, in either glucose or glycerol and ethanol. Every graph 

is generated from 20,000 cells. a, Control strain (YEZ186) with GFP 
under P yg; exposed for 3h to constant blue light (magenta) or kept for 
the same amount of time in the dark (cyan); these samples are almost 
completely superimposed on the right-hand side of graph. Control strain 
(YEZ140) with no GFP exposed for 3h of constant blue light (red) or kept 
for the same amount of time in the dark (grey); these samples are almost 
completely superimposed on the left-hand side of graph. Under these 
conditions, there is no detectable photobleaching. b, Light-induced GFP 
expression in YEZ139, a strain with GFP controlled by OptoEXP. GFP 
expression in YEZ139 in SC-His medium supplemented with 2% glucose 
after 3h of exposure to blue light (green) is homogeneous across the cell 
population, and 37-fold higher than in YEZ139 cells kept in the dark for 
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the same amount of time (orange). The maximum level of GFP 
expression obtained by OptoEXP in YEZ139 grown in full light for 

3h (green) is 22.1% of what is achieved in YEZ186, which contains 
Prer;-GFP, grown under the same conditions (magenta). Fluorescence 
from a control wild-type strain without GFP, YEZ140, grown for 3h in the 
light, is shown for comparison (red). c, Light-induced GFP expression by 
OptoEXP in YEZ243 in SC-His medium supplemented with 3% glycerol 
and 2% ethanol. Starting from cultures grown in the dark, samples were 
taken (in the exponential growth phase, at an ODg0o of approximately 3) 
and incubated in 24-well plates under the following light conditions: 2h 
in the dark (orange); 1.5h in the dark followed by 30 min in light (green); 
1h in the dark, followed by 1h in light (brown); or 2h in light (dark blue). 
YEZ140 (red) and YEZ186 (magenta) were used as controls with no GFP 
expression and GFP expression from a strong, constitutive promoter 
(Prer1), respectively. d, Example of the gating used to make the flow 
cytometry plots in a—c. All experiments were repeated at least three times. 
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Extended Data Figure 3 | OptoEXP performance under LED panels 
of different intensities. Light-induced expression of GFP in YEZ243 
(OptoEXP driving GFP) compared to constitutive GFP expression in 
YEZ186 (Preri-GFP), using light panels of different intensities. Pulsed 
light was applied at duty cycles of 10s on and 70s off. Data are shown as 


mean values; dots represent individual data points; error bars represent the 


s.d. from four biologically independent 1-ml culture sample replicates. 
All experiments were repeated at least three times. 
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Extended Data Figure 4 | Comparison of OptoINVRT light-repressible 
transcription circuits. a, OptoINVRT circuit design, based on the 
expression of Gal80 from OptoEXP and of Gal4 from constitutive 
promoters of different strength, with or without a PSD domain. b, Genes 
controlled by OptoINVRT circuits are repressed in the light and activated 
in the dark by the repression activity of Gal80 on Gal4 transcription factor. 
The PSD fused to Gal4 in OptoINVRT3 stimulates protein degradation 
in the light. c, Screens for lactic acid production in 2% glucose of several 
colonies of strains YEZ144 (OptoINVRT1), YEZ145 (OptoINVRT2) 

and YEZ146 (OptoINVRT3), using growth parameters: p=5 and 

6=6h, where p is the cell density at which cells are moved from light 


to dark and 0 is the time cells are incubated in the dark before starting 

the fermentation (n= 7 biologically independent colonies). d, Screens 

for isobutanol production in 2% glucose of several colonies of YEZ159 
(OptoINVRT1), YEZ156 (OptoINVRT2) and HPY6 (OptoINVRT3), using 
growth parameters: p = 8 and 0= 3h (n= 12 biologically independent 
colonies). e, Screens for isobutanol production in 2% glucose of several 
colonies of YEZ159 (OptoINVRT1), YEZ156 (OptoINVRT2) and HPY6 
(OptoINVRT3), using growth parameters: p=5 and 0=3h (n= 12 
biologically independent colonies). The screens shown in c-e were 
performed once in our laboratory. 
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Extended Data Figure 5 | Light-controlled lactic acid production. 

a, Lactic acid is produced by the reduction of pyruvate by Ldh. PDC1 

is controlled by OptoEXP and Ldh by OptoINVRT circuits. b, With 
optogenetic controls, light can be used to separate fermentation into two 
phases: a growth phase when cultures are exposed to light, during which 
PDC1 is expressed and Ldh is repressed, and a lactic acid production 
phase when cells are in the dark, during which PDC1 is not induced, and 
Ldh is expressed. c, Experimental design for the screening of strains and 
optimization of conditions. p and 0 were varied in these experiments. 

d, Three OptoINVRT circuits were tested for lactic acid production: 
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OptoINVRT1 (YEZ144); OptoINVRT2 (YEZ145) and OptoINVRT3 
(YEZ146). Top, dependence of lactic acid titres on p. Bottom, dependence 
of the ratio of lactic acid to ethanol on p. Fermentations were done in 
26.5 g1! glucose and run for 2 days. All samples had @ = 6h. Data are 
shown as mean values; dots represent individual data points; error bars 
represent the s.d. of three biologically independent 1-ml culture sample 
replicates exposed to the same light conditions. *P < 0.05, **P< 0.01, 
***P < (0.001. Statistics are derived using a one-sided t-test. All 
experiments were repeated at least three times. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a 
8 
6 
8 
p 
So 
FS 4 
% 8 
wn 
a} 
a 
8 
2 
0 
YEZSOlost YEZ61-23 YEZ144 YEZ145 YEZ146 YEZ167-4 
b Yeast Strain 
= —e— PoC! -—e— iLv2 ane 
& 600 
i= 
9 = 
2 Oo 
a 
: : 
1S) 
a 400 g 
g ‘ 
Z 0) 
i 8 
o ° 
= =] 
cS 
200 
0 
48 96 144 192 264 
hours after innoculation 
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genomic DNA samples (see Methods). All strains have one copy of PDC1 production in 0.5-1 fermenters. qPCR was performed on samples from 
integrated in the HIS3 locus, and the rest are integrated in random fed-batch fermentations for isobutanol production (Fig. 3e) to measure 
5-integration sites (except YEZ50lost, which only has one copy in the HIS3 _ concentrations of PDC1 and ILV2 transcripts. Gene expression was 
locus). Data are shown as mean values; dots represent individual data normalized to ACT] transcripts. Lines represent average values from 
points; error bars represent the s.d. from three biologically independent samples taken from two biologically independent fermentations run under 
1-ml culture sample replicates. All experiments were repeated at least the same conditions. All experiments were repeated at least two times. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


° 
a 08 
2% glucose 

=~ fermentation 
-! 

— 

io) 

2 

ne 

© 

4 

= 04 

[e} 

Cc 

iv] 

£ 

a 

we} 

fo} 

2 


p (OD at switch to dark) 


Extended Data Figure 7 | Optimization of light-controlled isobutanol 
production. a, Dependence of isobutanol titres on p. Cells were grown 
with 0 = 3h; fermentations were done in 21.5 g1~' glucose; isobutanol 
titres were measured after 2 days of fermentation in the dark. YZy335 is 

a control strain with a constitutive isobutanol pathway plasmid and wild- 
type PDC1, PDC5 and PDCé6 and was used in 2-day fermentations at high 
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cell density as a control. b, Dependence of isobutanol titres on 0. Cells 
were grown to p = 8.5. Fermentations were again done in 20.8 g1~' glucose 
for 2 days in the dark. All data are shown as mean values; dots represent 
individual data points; error bars represent the s.d. of three biologically 
independent 1-ml culture sample replicates exposed to the same light 
conditions. All experiments were repeated at least three times. 
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Extended Data Figure 8 | Optimization of high glucose fermentations. 
a, Glucose remainders as a percentage of initial glucose concentration 
after 48h (20g1-’ initial glucose) or 80h (150 g1! initial glucose) of 
fermentation of YEZ167-4 in the dark. Cell growth parameters: p = 8.5 and 
6=4h (for fermentations in 20 g1~! glucose) and p=9.5 and 0=3h (for 
fermentations in 150 gl“! glucose). Data are shown as mean values; dots 
represent individual data points; error bars represent the s.d. from three 
biologically independent 1-ml culture sample replicates. All experiments 
were repeated at least three times. b, NAD*/NADH ratio recovery through 
light pulsing. NAD*/NADH ratios were measured in samples under 
similar batch fermentation conditions as shown in Fig. 3c (see Methods). 
YEZ167-4 cultures were diluted into six 24-well plates and grown to an 
OD6o00 of 9.5 and left in the dark for 3h before resuspending cells in 

15% glucose medium. Four of the plates were pulsed every 10h for 

30 min at a duty cycle of 15s on and 65s off. Cells were harvested 

after 48-h fermentations, and at different times after the last light pulse 

(0, 1.5, 3.5, or 7.5h). Control plates were kept under full light or in the dark 
throughout the 48-h fermentations. NAD*/NADH ratios in pelleted cells 
were measured following a previously described method®. Data are shown 
as mean values; dots represent individual data points; error bars represent 
the s.d. of four biologically independent 1-ml culture sample replicates. 

All experiments were repeated at least three times. 
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Extended Data Figure 9 | Diagrams of light-stimulated laboratory-scale 
fermenters used to test YEZ167-4. a, b, The 2-1 bioreactor was set up so 
that three light panels could be placed around the fermenter. A magnetic 
stir plate and stir bar were used to mix the culture, and fermentations were 


performed in a 30°C warm-room. ¢, d, The 500-ml fed-batch bioreactor 
was set up so that two light panels could be placed around the fermenter. 
The culture was mixed with a motorized propeller and a heat plate with 
temperature control probe was used to maintain the temperature at 30°C. 


© 2018 Macmillan Publishers Limited, part of Springer Nature. All rights reserved. 


LETTER 


a b 
© 
Q 
3 
Ee 
Oo 
for 
N 
= 
(5) 
= © 
Q 
= 
19. cm 
c 
400 YEZ140, 
no GFP Control 
— ODd=15 
( ) 
80 ~— Preinduction 
© OD = 16 
2 1 hour 
£ 60 
[= 
8 
5 @ OD =19 
a 
mo] 
g 6 hours 
o 
E 40 
2 @ oD =41 
24 hours 
20 @ OD = 46 
32 hours 
OD = 50 
0 40 hours 
0 10° 1 of 1 0° 
GFP Signal 
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Cells were grown in fed-batch mode using a glycerol feed to achieve the which are very similar, are available upon request. All experiments were 


highest cell densities possible in this setup. Yeast cells were exposed to light _ repeated at least three times. 
when they reached an ODgoo of 15 and left under continuous illumination 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature25499 


Erratum: Non-adaptive plasticity 
potentiates rapid adaptive 
evolution of gene expression in 


nature 


Cameron K. Ghalambor, Kim L. Hoke, Emily W. Ruell, 
Eva K. Fischer, David N. Reznick & Kimberly A. Hughes 


Nature 525, 372-375 (2015); doi:10.1038/nature15256 


There are several errors in this Letter. First, owing to a production error, 
the Sequence Reads Archive (SRA) accession number is published as 
‘SRP06236’ rather than ‘SRP062364. Second, owing to an author error, 
the sample size for the number of samples for the low-predation (LP) 
population reared without predators is listed in the Methods as ‘n = 3, 
rather than ‘n= 4 Third, owing to an author error, in Supplementary 
Table 1, the treatment columns referred to as ‘NP’ and ‘P’ are reversed 
for the HP population. Furthermore, we have identified an error in 
the code used to produce some of the simulations. Permutation distri- 
butions used to identify concordantly differentially expressed (CDE) 
genes inadvertently omitted data from the LP population. We have 
now reanalysed the data, including that population. In this revised 
analysis, all of the originally reported patterns remain unchanged, but 
the magnitude of statistical significance is reduced compared to the 
original analysis. The main results and conclusions remain unchanged, 
but because the reanalysis alters the set of genes deemed CDE, all sub- 
sequent tests based on the subset of CDE genes also required reanalysis. 
The changes to the original text, tables and figures reported in this 
Letter are described in more detail below. 

(1) The number of adaptively evolved (CDE) genes decreased from 
125 to 94, and remains outside the interquartile range for the permu- 
tated distribution (23, 73), and in the upper decile of the permuted 
distribution. 

(2) The correlation between evolutionary change and non-adaptive 
plasticity (Fig. 2 of the original Letter) remains almost unchanged 
(—0.81 in the new analysis, and —0.82 in the original analysis). 
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As before, the new correlation remains well outside the interquartile 
range of the permuted correlations (—0.47, —0.59), and smaller than 
any of the 1,000 permutation values. See Supplementary Fig. 1 to this 
Erratum. 

(3) The correlation between ancestral and derived plasticity (Fig. 3 
of this Letter) has increased from —0.588 in the original analysis to 
—0.743 in the new analysis. As before, the new correlation remains 
well outside the interquartile range of the permuted correlations 
(—0.57, —0.68), and in the lower 3.5 % of the permuted values. 
See Supplementary Fig. 2 to this Erratum. 

(4) The results comparing the observed amount of divergence in 
gene expression versus the amount of putative neutral divergence 
reported has remained almost unchanged (Extended Data Table 1 of 
the original Letter). The new Per values for ‘Introl’ and ‘Intro2’ are 0.33 
and 0.25 versus the original Psr values of 0.32 and 0.27, respectively. All 
other results remain the same. Supplementary Table 1 to this Erratum 
contains the corrected version of the original Extended Data Table 1 
that shows the comparison of gene expression divergence (Psy) with 
divergence of putatively neutral microsatellite loci (Fer). 

(5) The number of significantly plastic genes increased from 565 to 
658 (in Supplementary Table 2 and Extended Data Fig. 4 of the original 
Letter). As before, most of these transcripts exhibit plastic changes 
within the ancestral population that are in the opposite direction of 
population divergence: 67% (439 out of 658) versus 75% (424 out of 
565) in the original analysis. The observation of 658 significantly plastic 
genes is well outside the interquartile range and in the lower 4% of 
permuted values. See Supplementary Fig. 3 to this Erratum. 

The permutation tests used in this study are critical in determining 
whether the observed patterns lie outside the range of expected values 
given the structure of the data. The inclusion of the naturally occurring 
LP population in the reanalysis presented here represents a more con- 
servative test of the occurrence of adaptively evolved CDE genes, and 
as a result weakens the magnitude—but not the direction—of some of 
the observed patterns. In summary, support for the main results and 
conclusions remains robust and our interpretation of the data remains 
unchanged. The original Letter has not been corrected online. 


Supplementary Information is available in the online version of this Erratum. 
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NASA scientist Josh Willis (left), who has also done stand-up comedy, celebrates success on a mission he jokingly called OMG (Oceans Melting Greenland). 


How laughter in the lab can 
improve your science 


Science is serious business — but it also shows humour can be a great thing for a team. 


BY AMBER DANCE 


he lab is sometimes a silly place — and 
Tretia it should be. A group that 

behaves in daft ways from time to time 
tends to be one that is positive, results-oriented 
and successful, says Michael Kerr, a business 
speaker in Canmore, Canada, who focuses on 
humour in the workplace. Jokes and pranks 
can serve as signs of a healthy workplace, 
and provide ways to foster trust and good 


communication among staff, Kerr says. 

A 2012 meta-review of studies on humour 
in the workplace found that it is linked with 
strong employee performance, effective 
stress-coping mechanisms and sturdy group 
cohesiveness’. The study, conducted by two 
management researchers at the University of 
North Carolina, Wilmington, and a psycholo- 
gist at Florida International University in 
Miami, also linked humour to reduced burn- 
out among employees. 


Although humour has its benefits, research- 
ers caution that jokes and laughs must be 
appropriate for the workplace and lab mem- 
bers should avoid making fun of each other 
in potentially or clearly hurtful ways. “Make 
sure youre not harassing somebody or singling 
someone out,’ says Sophie Scott, a cognitive 
neuroscientist at University College London. 
“Banter can be bullying” And simply saying 
“T was joking” doesn’t undo the hurt, she adds. 

As any comedian knows, attempts at 
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> humour can fall flat or even backfire. Adam 
Ruben, a molecular biologist at Sanaria, a bio- 
technology company in Rockville, Maryland, 
worries that humour could ruin a younger 
scientist’s chances of being taken seriously. 
Ruben does stand-up comedy on the side, but 
keeps his major scientific talks mostly free 
of jokes. 

Newcomers to a lab group should get to 
know their labmates and principal 
investigator well before they start joking 
around, advises Bernie Chun Nam Mak, 
a lecturer at Hong Kong Baptist Univer- 
sity who studied workplace strategies, 
including humour, during his PhD pro- 
gramme in applied English linguistics at 
the Chinese University of Hong Kong. 


BURST THE STRESS BUBBLE 

There's no shortage of hilarity in the lab 
where James Utterback, a PhD student 
in physical chemistry at the University 
of Colorado Boulder, works. His greatest 
prank (so far — labmates beware) was 
inspired by a laser system that arrived in 
late 2015 for the group’s studies of solar 
photochemistry. It came in a crate with 
metres and metres of bubble wrap. 

While another group was in a meeting, 
Utterback and his accomplices coated 
their student office’s floor, desks and 
printer with the bubbly sheets. They 
hid in the office so that they could catch 
their colleagues’ shocked reaction. Both 
groups laughed and entertained them- 
selves popping the plastic bubbles. 

But that wasn’t the last of it. When 
Utterback returned to his office after a 
conference in mid-2017, he discovered 
hed been replaced — with a bubble-wrap 
mannequin, complete with a wig and 
clothes. “He had been named James 2,” 
says Utterback. “He became kind of like 
our group mascot.’ 

James 2 regularly rotated between the 
lab’s student offices, surprising people 
who turned on the lights to find him diligently 
‘working’ at their desks. 

“Working with James [1] was seriously 
delightful” says Amanda Grennell, a free- 
lance science writer in Missoula, Montana, 
who earned her PhD in August 2017 from 
the same lab. “Pranks gave my brain a much- 
needed break from both work and stress” 

Graduate studies and science in general 
can be frustrating and isolating, agrees Jorge 
Cham of Los Angeles, California, who earned 
a PhD in mechanical engineering from Stan- 
ford University in California. He started a PhD 
comic strip, ‘Piled Higher and Deeper’ soon 
after beginning graduate school, as a sort of 
art therapy to cope with the pressure cooker 
of academic training. 

After a stint as an instructor at the California 
Institute of Technology in Pasadena, he 
switched in 2005 to cartooning full-time. He 


regularly lectures about the misery of graduate 
studies and the joys of procrastination. 

This unusual career path has given Cham 
more fame than he probably ever would 
have earned in academic robotics — his 
original plan. The comics, which have drawn 
188,000 followers on Twitter, have appeared 
in more than 50 US newspapers and been 
collated into 5 collections that have collec- 


Helena Gonzalez teaches science through comedy. 


tively sold more than 100,000 copies. They've 
helped many a stressed graduate student 
laugh through tough times, simply by show- 
ing readers that they're not alone in their 
academic struggles. “Burnout is so common, 
anything that can be done to help people keep 
perspective and enjoy what they’re doing, the 
better the long-term prospects of the lab are,” 
says Cham. 

A light attitude can also help to diffuse ten- 
sion that arises from failures or errors. Cham 
once spilt a bucket of mildly toxic resin in 
another lab that had a group leader he found 
particularly intimidating. He was horrified. “I 
vividly remember my life flashing before my 
eyes,’ recalls Cham. “I never thought that was 
something that actually happened.” 

But the postdoc supervising Cham took 
it good-naturedly, simply telling him he was 
allowed to make that mistake — once. That 
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helped Cham to move past the error. 

As that supervisor so deftly illustrated, 
humour can be a powerful tool for leaders. 
The meta-analysis of workplace humour found 
that when those in power were viewed as fun 
and funny, their subordinates performed bet- 
ter and had stronger teamwork. Staff were also 
happier with their jobs and their bosses. And 
a fun, lighthearted lab group might produce 
better science than one that is perpetu- 
ally solemn and serious. Humour, says 
Kerr, is a positive catalyst for think- 
ing creatively. Or, for mathematicians: 
“Ha + Ha = Aha” 

Light-heartedness can also reduce 
embarrassment when inevitable mistakes 
happen in the lab. “Humour can help 
workers, especially superiors, to imply 
something negative to each other ina less 
face-threatening way, says Mak. He once 
observed a situation in which a worker 
made a mistake on an Excel spreadsheet. 
Her supervisor admonished her by mak- 
ing a wry joke about the error, and they 
both laughed it off. 


LEAD WITH LAUGHTER 

A humour-filled lab might not happen 
spontaneously, so leaders might wish to 
actively encourage it. Scott, who studies 
speech and laughter, makes a conscious 
effort to build a happy team. 

But leaders should be wary of forcing 
humour onto junior scientists. “They 
have to feel comfortable first,” says Scott. 
“They have to feel part of a group.” She 
pays particular attention to new lab 
members — especially when they’re 
from other countries — ensuring that 
they feel comfortable, and not left out. If 
they’re laughing and joking, she knows 
that they’re settling in. 

To help foster a positive group 
dynamic, Scott often organizes lab 
social events and includes both personal 
and scientific discussions in casual lab 
meetings over coffee. The personal sharing is 
optional, of course. 

“The demeanour of the principal investi- 
gator will affect the atmosphere of the whole 
meeting,’ she says. “I try to keep a positive and 
friendly mood.” 

It might seem simple for Scott’s group to 
spice their work with humour — they study 
laughter, after all, and their work entails 
developing funny videos to make viewers and 
subjects chuckle. But no one has to be a stand- 
up comedian to infuse some humour into the 
lab. Kerr says that there are plenty of ways to 
make work fun, even for the less comedy- 
inclined. One option he likes is to put up a 
‘humour bulletin board. Lab members can 
contribute funny statistics, research results 
or cartoons. 

Starting group traditions is another way 
to make work enjoyable. Kerr suggests 


IGNACIO IZQUIERDO 


giving out fun awards, such as the ‘Most 
Likely to Overdose on Caffeine Award’. 
Employees feel appreciated, and everyone 
has a good time. 

Ruben recalls a fun tradition from his 
PhD lab at Johns Hopkins University in 
Baltimore, Maryland. Each scientist picked 
an orange test-tube stopper, or septum, 
and drew a little face on it. Once a week, 
theyd take a break to place the stoppers on 
a shaker; whichever stopper stayed on the 
longest before bouncing off won its scientist 
US$1 from each of the other entrants. That 
two-minute break provided a valuable bond- 
ing experience, Ruben says. 

And if one wonders what to joke about, 
“the easiest person to laugh at is yourself”, 
advises Josh Willis, a climate scientist at 
NASA's Jet Propulsion Laboratory in Pasa- 
dena, California, who has trained with 
comedy troupes The Groundlings and Sec- 
ond City. He christened one of his missions 
OMG, an initialism for ‘Oceans Melting 
Greenland. 

Willis is perfectly willing to engage in a 
bit of self-deprecating humour, and one of 
his studies made it easy to do that. In 2006, 
early in his career, he collected data showing 
that, despite the effects of global warming, 
some of the world’s oceans had somehow 
cooled between 2003 and 2005 (ref. 2). 
Willis endured a fair bit of ridicule for this 
‘discovery. Even conservative pundits got 
in on the harassment, using Willis’s paper 
as evidence that left-leaning scientists are 
clueless about climate. 

Willis, trusting his data, took it all in his 
stride. Then, in February 2007, he discovered 
the error hed made. No, the oceans weren't 
getting colder: certain temperature sensors 
had given bad readings. He published a 
correction shortly afterwards’. 

In response — playing on the talk-show 
punditry that Willis had endured — his wife 
gave him a set of business cards imprinted 
with the title ‘Idiot leftist scientist. He still 
carries the cards in his wallet. “That dose 
of humility and making fun of myself — in 
the long run, I think it benefited my career,’ 
says Willis. 


TREAD WITH CARE 

Some joke topics are simply not acceptable 
in any workplace, notes Nicki Fuchs, a 
stand-up comic and biochemical engi- 
neer at MedImmune in Gaithersburg, 
Maryland. Gender, politics, race and religion, 
among other matters, are off the table, she 
advises. 

Those rules still leave her with plenty of 
room to jest with her labmates. And it’s use- 
ful for her to joke about work, says Fuchs, 
because it helps her, a 30-year-old woman, 
to connect with the rest of the lab members 
— all older men. A recurring wisecrack is 
about whoever most recently left the water 
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running and flooded the lab — a not-uncom- 
mon occurrence, because their work involves 
filling up large, pressurized bottles. 

Jokes may fall flat in some settings. In 
graduate school, Ruben often ended his lab- 
meeting presentations with a joke slide. But 
during one such presentation, his adviser was 
already unimpressed with his scientific pro- 
gress, and Ruben suspects that his joke slide 
—acolleague’s head that was Photoshopped 
onto a goat’s body — deepened his adviser’s 
doubts. Since then, he uses jokes only in 
informal presentations or talks about sci- 
ence careers. 

Scientists should also be careful about 
humour that might not work well with 
people from other cultures. What’s funny 
to one ethnic group can be incomprehen- 
sible or offensive to 
another, Mak notes. 


Humour is a Grae ; 
positive It’s fairly simple 
to learn about the 
catalyst for sense of humour 
thinking in a geographical 
creatively. region to which 
Or, for or one is travelling or 
mathematicians, moving, adds Kerr: 


“Ha+Ha=Aha.” he just Googles it. 


Of course, some 
topics that scientists study — cancer, for 
example — arent funny at all. Yet jokes can 
help to ease tension and discomfort around 
specialities that deal with tragic subjects, 
says Helena Gonzalez, who earned a PhD 
studying the epigenetics of cancer in 2013 
and is now a science communicator with 
the comedy troupe Big Van Science in Bar- 
celona. “That kind of black humour releases 
your feelings and makes your work much 
easier,’ she says. 

Still, she adds, scientists need to be care- 
ful about where they make any such jokes. 
Generally, among a few close colleagues in 
the lab, it’s fine. When dealing with patients 
or the public, it’s not. 

And although pranks are fun, those that 
might endanger personnel, equipment or 
experiments have no place in the lab. Ruben 
recalls one supervisor he had at a summer 
job, who dropped a lit match into a recently 
emptied jug of ethanol. “A column of fire shot 
up to the ceiling” Ruben says. “He probably 
shouldn't have done that.” 

That said, everyone can use a chuckle now 
and then. If you're planning a — harmless! 
— April Fool’s lab jape, be sure to share it with 
@naturejobs, hashtag #AprilFools. = 


Amber Dance is a freelance writer in Los 
Angeles, California. 
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MENTAL HEALTH 
Degrees and depression 


PhD and master’s students worldwide 
report rates of depression and anxiety 
that are six times higher than those in the 
general public (T. M. Evans et al. Nature 
Biotech. 36, 282-284; 2018). The report, 
based on the responses of 2,279 students 
in 26 nations, found that more than 40% 
of respondents had anxiety scores in the 
moderate to severe range, and that nearly 
40% showed signs of moderate to severe 
depression. The high rates suggested by 
this study are alarming, says Teresa Evans, 
a neuroscientist at the University of Texas 
Health Science Center at San Antonio 
and the study’s lead author. She notes 

that students suffering from anxiety or 
depression might have been especially 
motivated to take the survey, which could 
have skewed the results. But she believes 
that the findings underscore the severity of 
the problem and the need for a response. 
Evans adds that universities should provide 
students with training to help them 
manage their time and cope with stress. 


GENDER 
Female authors scarce 


Female authors are scarce in the pages 

of high-impact journals, according to an 
online analysis (Y. A. Shen et al. Preprint at 
bioRxiv http://dx.doi.org/10.1101/275362; 
2018). Researchers at the University of 
Washington in Seattle gathered names of 
first and last authors from papers published 
between 2005 and 2017 in 15 major 
science and neuroscience journals, 
including Nature, Science, Proceedings of 
the National Academy of Sciences (PNAS), 
Nature Neuroscience and Neuropsychology 
Review. Women accounted for roughly 
25% of first authors in Nature and Science 
and just over 35% of first authors in 

PNAS. Female first authors outnumbered 
men — by 53% to 47% — in only one 
journal, Neuropsychology Review. Women 
represented about 15% of senior or 
last-author spots in Nature and Science and 
just under 40% in Neuropsychology Review. 
The study found an inverse relationship 
between the prevalence of female authors 
and the journal's impact factor: the higher 
the impact, the lower the proportion of 
women as first or last authors. Co-author 
Ione Fine, a neuroscientist, and colleagues 
suggest that journals could reduce the 
possibility of bias by adopting mandatory 
double-blind reviews. Nature Research, 
which includes all the Nature-branded 
journals, said that the brand “is committed 
to gender equality and our journals strive 
to support women in science”. 
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STARLESS NIGHT 


BY ANDREW JOHNSTON 


n ephemeral layer of crinkly ice 
A= the surface of the micro- 

phone, falling in glistening flakes 
as Matilda ran a gloved hand over the mic’s 
surface. It was a sturdy device, but there was 
simply no way of knowing if it still func- 
tioned; even the most overzealous engineers 
didn’t bother to plan for such conditions. 
The broadcasting array was still working, 
at least, or that’s what she could glean from 
those instruments that were still functional. 
Matilda had always thought that comput- 
ers were supposed to function best ina cold 
environment, but even a being of wire and 
steel must have its limitations. 

Matilda drew the microphone to her face, 
pulling down her parka just enough to allow 
for speech. “Explorer Nozek, broadcasting 
from Rogue Object X-27881. Cur- 
rent astronomical location 
is unknown. Proximity to 
other objects is unknown. 
Time since last broad- 
cast...” She pushed the 
microphone aside just in time to stifle 
a delirious laugh. “... I guess I don’t know 
how long it’s been. Funny, I hadn't really 
thought about it until just right now, but 
without a star to revolve around ora nice neat 
rotation, these rogue planets dont have ‘time’ 
like we do back on Earth, nice neat cycles that 
we can chart and predict and obsess over. 
Different circumstances, this might be very 
liberating.” 

Matilda buried her face in the parka as 
she fell prey to a violent coughing fit. The 
‘cold’ she had contracted was clearly psycho- 
somatic — she was in a hypochondriac’s 
dark dream, a place so frigid that no patho- 
gen could possibly survive in the air. Noth- 
ing could live here for long, or at least that 
was what Matilda — what everyone on Earth 
— had assumed before she landed. 

“Sorry about that,” said Matilda, idly 
tapping one of the displays. “OK. Current 
temperature is about —230 °C and dropping 
fast as we continue to move away from all 
known stars. Guess I should switch over to 
kelvin soon, that’s ... a pleasant moment, 
when you realize the freezing point of water 
has become grossly inadequate for what 

you are experiencing. 


> NATURE.COM Um... the heat pump 
Follow Futures: is still working, which 
W @NatureFutures is a minor miracle as 
EG go.nature.com/mtoodm it’s really only built for 
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Cold comfort. 


prolonged use in tempera- 
tures some 50 degrees higher than 

this. I expect it'll overload and break soon, 
probably around the time the back-up bat- 
teries give out and —” 

A strange rhythmic tapping sound split the 
silence, a tangle of tiny feet skittering across 
the metal plates in the shadows at the edge of 
the cramped chamber. Matilda shot a quick 
glance out of the corner of her eye at the new 
arrival but didn't halt for more than a second. 

“In concluding what I suspect will be 
my final missive, as ... well, I have no clue 
where I am so there's no chance of anyone 
else finding me ... anyway, I'd just like to say 
that the Rogue Exploration Project was a ter- 
rible idea. We're never going to know how 
many astronauts we've killed on this little 
journey, but I suspect the answer is all of us. 
“Oh no, sending robots isn’t good enough. We 
need human eyes on those planets. Consider 
the possibilities’ | can remember saying this 
myself, both while drunk and stone-cold 
sober, so I guess this is partially on me. If 
you do find me, don’t bother with a funeral 
or service, but go ahead and dig up what’s 
under my ship. Trust me on this.” 

Matilda’s final words vanished into 
another bout of coughing, one that she 


wilfully allowed the microphone 
to pick up — the better to let the 
recipients know what the end 
would look like. Terminating 
the broadcast, she let her head 
roll to one side, staring off into 
the shadows at the edge of the 
chamber. 

“You come to watch me die?” 
said Matilda. “Come on out, 
don't skulk” 

At first, she could see only the 
thing’s deformed-looking eyes, 
those recessed pits that glim- 
mered in the dim light. Then, 
with a series of clicks, itemerged 
into the shimmery light. First 
came the head, triangular and 
with few features aside from 
its eye-pits and a weird sucker- 
mouth whose mechanics were 
beyond knowing. Next came 
its slender body, jet black and 
covered in translucent fibrous 
hairs that turned rigid in time 
with its respiration. The odd 

little creature was supported by 
two asymmetrical lines of spindly 
appendages that clicked almost musi- 
cally as the thing advanced. 
“You alone this time, or did you 
bring your brothers?” Matilda sighed 
and slumped down in her seat, then sud- 
denly grabbed the microphone and yanked 
it forward. “Come on, make a sound. So 
you cant speak, but can't you growl or bark 
or... burble or groan or something? Can't 
you prove that you exist before I die? Wont 
you at least let me be known as the one who 
discovered you?” 

The creature merely cocked its head, 
moving itself up and down on its append- 
ages. It drew closer to Matilda like a curious 
dog, then backed away to size her up. There 
was a faint sound from the bowels of the 
vessel, or rather the absence of a sound that 
Matilda had ceased to notice. 

“That would be the heat pump going.” 
Matilda glanced at the creature as the cold 
enveloped her. “I guess the Universe is 
always going to keep its secrets, huh?” 

Matilda closed her eyes and drifted off to a 
sleep from which she would never wake. The 
creature's eye-pits never left her. = 


Andrew Johnston is a former Pacific- 
hopping ESL teacher currently scrapping 
away in the dark spiritual heart of Kansas. 
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Stockholm is home to several entrepreneurial companies that are reshaping Sweden’s scientific industry. 


fii 


Science diversified 


After suffering heavy blows, Swedish science shows signs 


ergers, acquisitions and restruc- 
turing are par for the course in the 
pharmaceutical industry. Markets 


grow and shrink, new treatments are devel- 
oped and corporate priorities change. Jobs are 
created, moved and lost. This came as no com- 
fort to the 600 people left out of work when 
AstraZeneca shut down its research facility in 
Lund, Sweden, in 2011. As the ripples spread, 
some saw it as the latest in a series of blows 
signalling the death of a local industry. 

Seven years on, and although Sweden's 
pharmaceutical sector has yet to recover, those 
taking a wider view see a more positive pic- 
ture emerging. Major new materials-science 


of sustainable recovery. 
BY NIC FLEMING 


facilities in Lund are expected to open up 
prospects in a wide range of fields. 

“AstraZeneca left a big black hole in a small 
country,’ says Tina Persson, a career coach in 
Stockholm. “My feeling is that things are grow- 
ing again, but it’s a new environment with lots 
of start-ups and innovative small companies. 
Those looking for work in the sector need 
to understand how the culture has changed 
so they can best find where their skills are in 
demand” 

Sweden has a tradition of investing in 
research and development (R&D): it spends 
3.4% of its gross domestic product on science, 
placing it fifth in the world. This helps to 


explain why the country came second (behind 
South Korea) in the 2017 Bloomberg Innova- 
tion Index. Policymakers used to worry that 
this investment was not always matched by 
business and employment creation, says Asa 
Lindholm Dahlstrand, who studies innovation 
at Lund University. However, this seems to 
be changing as Swedes take on board trends 
such as the decline of big pharma, and as the 
country undergoes wider cultural shifts. 

“Tt used to be that most people in Sweden 
expected to find a job for life with a large 
multinational company, and we were relatively 
weak in entrepreneurship,’ says Lindholm 
Dahlstrand. “It has taken along time but the > 
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> culture is becoming more individualistic, 
and we've seen a growing number of innova- 
tive start-ups.’ Sweden's unusual legislation for 
intellectual-property rights is also encouraging 
(see ‘Research rewards). 


DOCTOR, DOCTOR 

One area seeing rapid growth is electronic 
health, or e-health — a broad sector that 
encompasses electronic medical records, 
virtual health-care consultations, health IT 
systems and telemedicine. Sweden is a highly 
connected society: in 2016, 93% of inhabitants 
had Internet access, 79% made online pur- 
chases and 77% had smartphones or portable 
computers — all well above the average for the 
European Union. 

The market for virtual doctors is expanding, 
with two companies — Min Doktor in Malmo 
and KRY in Stockholm — using voice, video 
and text services to cut health-care providers’ 
costs. In March 2017, 13,000 digital medical 
consultations took place in Sweden, which 
runs a public health-care system largely funded 
by taxpayers. “We're seeing rapid growth in 
the number of digital health companies and 
initiatives,” says Nima Jokilaakso, project 
manager for e-health at Swecare in Stockholm, 
a public-private group seeking to boost Swedish 
health-care and life-sciences exports. 

In 2016, Sweden’s government and the 
Swedish Association of Local Authorities and 
Regions adopted a target of making the coun- 
try a global leader in e-health by 2025. Joki- 
laakso, however, says politicians have failed to 
match ambition with action to address e-health 
business concerns. “They want to demon- 
strate they care about the sector, but there is 
no funding or plan connected to the issues that 
are relevant to e-health companies seeking to 
grow.’ These issues include regulatory hurdles 
and political hostility to making profits from 
health care. 

Competition for conventional academic 
and industrial research roles can be stiff, but 
e-health is one of the areas in which doors are 
opening for those willing to be flexible. Com- 
panies offering virtual-doctor and telemedi- 
cine services, for example, need specialists for 
their telephone advice lines — and are there- 
fore hiring medically trained PhD graduates. 
“Ten years ago, PhDs werent needed in those 
positions, but the world is becoming more 
technical,” says Persson. 


SCIENCE CENTRES 

The job market looks different in each of 
Sweden’ three largest cities (Stockholm, Goth- 
enburg and Malmo). In Stockholm, Persson 
says, R&D roles are mostly found in small- and 
medium-sized companies, as well as univer- 
sities. There is also a high demand for those 
with mathematics and physics PhDs to work 
as data scientists and analysts in the finance 
industry. Persson says that a growing number 
of researchers are finding work consulting for 


companies, and others are transferring their 
skills into related roles in sales, marketing and 
regulation. 

AstraZeneca still has an R&D centre in 
Molndal, near Gothenburg, where research- 
ers focus on cardiovascular, metabolic and 
respiratory disease, as well as inflammation 
and autoimmunity. Jobs in the Mélmo-Lund 
area are mostly at Lund University and in 
small businesses and start-ups, such as those 
in e-health and medical technology. The small 
pool of jobs on offer there makes networking 
all the more important. 

“To those who come from abroad who 
say it is hard to get a job, I often explain that 
Sweden is a small country with small, narrow 
networks,” Persson says. “The trick is to make 
sure you are in them.” She suggests taking part 
in Swedish activities, going to meet-ups and 
making full use of alumni, expat and Facebook 
groups. Persson also suggests that prospective 
candidates try to pick up some of the language, 
“even though most Swedes speak English”. 
Although not essential for science, it does 
help with integrating into the local culture. “It 
doesn't have to be perfect, but it is an important 
factor when job hunting, and one often forgot- 
ten by academics.” 


THINKING BIG 

Sweden is expecting more visiting foreign 
researchers in the near future, thanks to the 
development of two world-leading research 
facilities in Lund. MAX IV, which opened in 
2016, is a synchrotron radiation source that, 
when fully operational, will be about 100 times 
brighter than existing facilities. Synchrotrons 
are particle accelerators that use magnets to 
steer electrons around a circular vacuum tube 
at close to the speed of light, causing them to 
radiate high-energy X-rays. 


THE 


DIVERSIFIED 


AND ENTREPRENEURIAL 
LANDSCAPE EMERGING 
OFFERS GREATER 


RESILIENCE. 


Thanks to some clever engineering from 
Mikael Eriksson, a Swedish-born academic 
and pioneer of the previous three MAX 
synchrotrons, MAX IV’s 528-metre-circum- 
ference storage ring can push electrons closer 
together and create brighter X-ray pulses than 
its predecessors. MAX IV also includes another, 
smaller storage ring and a linear accelerator for 
different types of experiment. It is being used 
to study materials in greater detail than ever 
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PROFESSOR’S PRIVILEGE 
Research rewards 


One unusual aspect of academic 
research in Sweden is the long-standing 
law known as professor’s privilege: 
unusually, university employees keep the 
intellectual-property rights to discoveries 
they make. 

This encourages the creation of 
spin-off companies, increases patenting 
activity and can help universities to 
recruit high-quality researchers. “It 
has attracted some good international 
researchers who want to work with 
companies and commercialize their 
work,” says Asa Lindholm Dahlstrand, 
who studies innovation at Lund 
University. N.F. 


before, shining a light on chemical reactions in 
batteries, the precise structure of nanoparticles 
and the mechanisms of new drugs. 

Around 2,500 researchers per year are 
expected to use the facility when it becomes 
operational in 2019. “I think MAX IV will 
do for Sweden and the surrounding region 
what CERN has done for Switzerland,” says 
Soren Bregenholt, vice-president at Danish 
pharmaceutical company Novo Nordisk, refer- 
ring to Europe's particle-physics laboratory 
near Geneva. “It will attract a lot of research- 
ers and develop into a point of gravity for 
materials and biological-materials sciences.” 

Also under construction in Lund is the 
European Spallation Source, which will open 
to users in 2023. The facility will include a 
600-metre-long proton accelerator capable of 
producing neutron beams two orders of mag- 
nitude brighter than those currently available. 
Lund University, Chalmers University of Tech- 
nology in Gothenburg and Uppsala University 
have been recruiting researchers who can use 
these world-leading tools for materials science. 
“We are seeing new positions being opened in 
areas such as magnetic materials and catalysis, 
because universities want to position them- 
selves to exploit these facilities,” says Tomas 
Lundgqvist, science director at MAX IV. 

Few Swedes would have chosen this rocky 
road, with such a major decline for their 
country’s pharmaceutical industry over 
the past 20 years. Despite this, there may be 
long-term gains from the short-term pain: as 
Sweden discovered, reliance on a small number 
of multinationals in changing times can be 
dangerous. The diversified and entrepreneurial 
landscape currently emerging offers greater 
resilience to the country as a whole — and new 
opportunities to those ready to exploit them. = 


Nic Fleming is a freelance writer based in 
Bristol, UK. 
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Q&A Elina Berglund 
Physics to fertility 


Swedish particle physicist Elina Berglund developed a contraception algorithm, based on tracking 
ovulation rates, while working on the Higgs boson at CERN, Europe’ particle-physics laboratory 
near Geneva, Switzerland. Today she lives in Stockholm, where she runs a company that markets 
and sells her Natural Cycles app. 


What led you to develop Natural Cycles? 

I wanted to give my body a break from 
hormonal contraception, because my now- 
husband Raoul and I wanted to have kids ina 
few years. I couldn't find any effective natural 
alternatives; however, I learnt that analysis of 
body-temperature variations can pinpoint fer- 
tility. From this, I developed my own algorithm. 
Both the Higgs discovery and our marriage took 
place in 2012. While on our honeymoon, Raoul 
suggested others might pay to use an app based 
on the algorithm. We launched Natural Cycles 
in 2013 and moved to Stockholm in 2014. 


How does Natural Cycles work and how 
effective is it? 

Women have long used fertility-awareness- 
based contraception methods, but they aren’t 
very effective because ovulation can be difficult 
to predict. Natural Cycles users input daily 
temperature measurements. They can also add 
menstruation dates and hormone-test results. 
The algorithm returns green or red days, telling 
users whether they need to use protection. 

Our recently published large study found that, 
taking human error into account, 7 out of 100 
women per year will get pregnant while using 
Natural Cycles, compared with 9 in 100 for the 
pill and 18 in 100 for condoms. 


Is Natural Cycles just for contraception? 
It can also help to reduce time to pregnancy 


for women trying for a baby. When we started 
trying, I got pregnant first time, partly because 
I knew exactly when I was ovulating. Our 
daughter Alba is now three-and-half. 


Is Sweden a good place for scientists and 
start-ups? 

Scientists in Sweden enjoy good salaries 
and funding, and have a lot of research free- 
dom. It’s a modern and tech-savvy society, 
which is especially good for start-ups. We 
started offin Switzerland, and when we had 
investment rounds we would have to get 
documents printed and go to a notary’s office, 
whereas in Sweden things are quicker and less 
bureaucratic, and more can be done online. 
Things like transportation work well and it’s 
a good climate for innovation. It’s very family- 
friendly because you get 16 months of parental 
leave and the price of day care is minimal. 


What’s the biggest downside? 

In December it gets dark around 2.30 p.m.. 
It can make you feel as though you want to 
hide until the sun comes back out. People 
go out and interact with others less. On the 
plus side, it’s lovely on the first warm day of 
spring when everyone is out in the sun with 
an ice cream. @ 
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This interview has been edited for clarity and style. 
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Crossing borders 


A spirit of collaboration — and an engineering icon — have together 
supported the emergence of a major scientific hub. 


Europe’s longest road and rail bridge. 

Aficionados of crime dramas will forever 
associate the spectacular 7.8-kilometre section 
between Malmé6 and Copenhagen with the 
opening credits of the popular Swedish-Danish 
series The Bridge. For scientists in the region, 
its opening, in 2000, underpinned the develop- 
ment of Medicon Valley, one of Europe’ strong- 
est life-sciences clusters. 

Medicon Valley consists of the island of 
Zealand in eastern Denmark — home to 
Copenhagen, the country’s capital — and 
the Skane region of southern Sweden (see 
‘Science hub’). Its population of 4 million 
amounts to only around one-quarter of the 
combined population of the two countries. Yet 
the valley’s 41,000-strong life-sciences work- 
force accounted for 58% of people working in 
the sector across both nations in 2015. 

The hub is going through a process of change 
against a backdrop of global disruption, creat- 
ing both challenges and opportunities for 
researchers nearby. Some changes are playing 
out differently on either side of the bridge. 

The 2008-15 period, for example, saw 
employment in the life sciences drop by 31% 
in Skane, but jump by 13% in the Danish part 
of Medicon Valley. 

This can largely be explained by events in the 
pharmaceutical industry. Most prominently, 
multinationalgiant AstraZeneca shut its research 
facility in Lund, with the loss of 600 jobs, in 2011. 
Growth on the Danish side has been driven by 


Haters may know @resund Bridge as 
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Novo Nordisk in Bagsveerd, which has around 
17,500 employees in the region, as well as by 
smaller pharmaceutical companies. 


MAKING MEDICINE 

For early-career researchers intending to work 
in the region’s pharmaceutical sector, demand 
for solid computing knowledge has grown with 
the rise of big data — many more businesses 
now need trained number crunchers.“Once, 
statisticians could primarily get work in the 
pharmaceutical industry, insurance and 
finance, but now they are in demand all over, 
so we face more competition to recruit them,’ 
says Erik Kristensen, a human-resources 
executive at Novo Nordisk. 

The company — which produces drugs to 
manage diabetes, obesity and haemophilia — is 
also looking for epidemiologists, pharmacolo- 
gists, protein scientists, medical doctors and 
biologists focused on metabolic disease. 

However, although Medicon Valley’s 
pharmaceutical companies are hiring for 
specific positions, hundreds of jobs have been 
lost in the sector over the past three years. The 
region’s companies are not immune to global 
pressures facing the industry, not least the 
long-term decline in research and development 
productivity. Big pharma’ issues have, however, 
opened up opportunities for others. The bio- 
technology industry took off in the region in 
the 1990s, before the bridge was built. Today, 
there are around 150 biotech companies in the 
region. Europe’ biggest biotech, Genmab, has 


its headquarters and a clinical development 
team in Copenhagen. Founded in 1999, it now 
employs some 250 people and has 2 approved 
immunotherapeutic-antibody treatments. 
Genmab is also working on other antibody 
therapies, and has more than 100 ongoing 
clinical trials for various therapies. 

For Jan van de Winkel, Genmab’s chief exec- 
utive, Medicon Valley is ideally placed thanks 
to the proximity of major Danish pharma- 
ceutical players, and Swedish universities and 
hospitals across the bridge. “Being networked 
and gaining inspiration from others is key to 
Genmabs success,” says van de Winkel. “I see 
avery positive future for Medicon Valley as 
the connections between different parts of the 
life-sciences ecosystem grow stronger still” 

This growing emphasis on collaboration 
and networking is underlined by the rise of 
the region's science parks, where researchers, 
entrepreneurs and established businesses learn 
from each other. Before the bridge and the 
creation of Medicon Valley, the only science 
parks in the area were Ideon in Lund, Medeon 
in Malm6 and Symbion in Copenhagen. Since 
2004, three more have opened. Today, one in 
ten of those working in the life sciences in the 
region is based in a science park. 

The Copenhagen Bio Science Park 
(COBIS) was set up in 2009 to support early- 
stage life-sciences organizations. It currently 
hosts more than 100 start-ups and employs 
around 350 people. COBIS rents out facili- 
ties, offers business-development services 


@resund Bridge, 
joining Sweden and 
Denmark, supports 
a major science hub. 
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and provides early-stage investment funding 
to promising projects. 

“There’s a lot of buzz in the sector right 
now,” says Morten Molgaard Jensen, COBIS 
chief executive. “The number of new projects 
and companies spinning out of institutions is 
on the rise. We offer an environment where 
everyone knows each other and start-ups can 
get established while enjoying cheap offices 
and access to shared lab space.” 


CROSSING OLD BORDERS 

Medicon Valley’s nine universities employ 
nearly 7,000 life-sciences researchers. Of these, 
the University of Copenhagen comes out top 
in terms of international rankings, publication 
volume and citations. As elsewhere, interac- 
tions between academia and industry have 
grown, thanks in part to policy initiatives such 
as the strengthening of technology-transfer 
offices, and wider cultural changes, including 
how academics view industry. 

Nils Briinner, a cancer researcher at 
the University of Copenhagen, is one of 
many academics who have embraced this 
environment. He helped to launch the start-up 
Scandion Oncology in May 2017. The company 
is investigating a drug candidate that has shown 
promise in enhancing the effects of some cancer 
treatments by overcoming drug resistance. 

Briinner says that ten years ago, as an 
academic researcher, you wouldn't want to 
tell people you were partnering with industry 
“because they might think you have a conflict 
of interest. That has changed very rapidly, 
and today academics are proud to work with 
businesses, and if you spin out a company you 
are a hero.” 

Briinner, a Dane born in Sweden, set up an 
oncology-networking organization in 2016 
with the help of the Medicon Valley Alliance 
(MVA), a life-sciences industry body. This 
seeks to drive progress against cancer through 
the sharing of ideas between academics and 
industry, from both sides of the bridge. The 
MVA has also set up similar medical technol- 
ogy and microbiome networks in the region. 


INTERNATIONAL IMPACT 
After the departure of AstraZeneca from Lund 
and several years of decline for life-sciences 
employment in Skane, there are signs of recov- 
ery. The Medicon Village science park, created 
in 2012 at AstraZeneca’s former research facility 
in Lund, now hosts more than 100 businesses, 
including biotech companies. Also in Lund are 
the MAX IV synchrotron radiation facility, 
completed in 2016, and the European Spallation 
Source, an advanced pulsed neutron source 
currently under construction. Next-generation 
materials-science facilities are expected to 
enhance Sweden’s reputation as a leading 
materials-science hub, providing insights 
across a wide range of fields and creating job 
opportunities in nearby universities. 

Those who promote Swedish-Danish 


SCIENCE HUB 


Medicon Valley spans international borders 
with health boration. 
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collaboration acknowledge that life sciences 
still face the challenges in the region. “We have 
everything Boston has,’ says Soren Bregenholt, 
chairman of the MVA and a corporate vice- 
president at Novo Nordisk. “We have top 
universities, great hospitals, pharmaceutical 
companies, biotech, a well-educated workforce, 
a flexible job market and competences that 
stretch from early research to manufacturing. 
However, we are more diluted geographically” 
Life-sciences leaders and politicians are discuss- 
ing new measures to combat this disadvantage, 
such as improved cross-border research and 
development, knowledge exchange, clinical- 
trial collaboration, and coordination of efforts 
to attract investment to the region. 

Another issue is a lack of availability of local 
capital, which can leave small but promising 
companies dependent on investment from 
abroad and vulnerable to foreign acquisition. 
“We have start-ups and grown-ups, but we lack 
the scale-ups,’ says Bregenholt. In 2015, for 
example, Danish oncology company EpiTh- 
erapeutics was snapped up for US$65 million by 
US company Gilead in Foster City, California. 

Large companies in the area also report 
some recruitment difficulties. “People can get 
higher salaries in the United States and Swit- 
zerland,’ says Kristensen. “Because of things 
like free health care and education, good 
transport infrastructure and a good work-life 
balance, however, we find that those that come 
tend to stay.” 

To outsiders, a life-sciences hub that crosses 
a national border and 8 kilometres of cold 
water might seem strange. Yet Skane was part 
of Denmark until the seventeenth century, 
and the two languages are very close. @resund 
Bridge has brought the neighbours closer still. 

“Danes understand Swedes and Swedes 
understand Danes,’ says Briinner. “They have 
more or less the same cultural background. 
Of course, there are differences in regulations, 
research focuses and viewpoints. But it’s being 
exposed to these that opens up possibilities to 
find more collaborators and achieve higher 
standards” = 


Nic Fleming is a freelance writer based in 
Bristol, UK. 
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